Certificate of Mailing 

Date of Deposit: October 21. 2003 Label Number: EV 320805734 US 

I hereby certify under 37 C.F.R. § 1.10 that this correspondence is being deposited with the United States Postal Service as 
"Express Mail Post Office to Addressee" with sufficient postage on the date indicated above and is addressed to Mail Stop Patent 
Application, Commissioner for Patents, P.O. Box 1450, Alexandria, VA 2231 3-1 45Q 


Guv Beardslev 


Printed name of person mailing correspondence Signature of person mailing corrfepondence 


! of person mailing corr&pondence 


APPLICATION 
FOR 


UNITED STATES LETTERS PATENT 


APPLICANT: Sakari Kauppinen, Carsten Alsbo, Peter S. Nielsen, Daniel 

C. Jeffares, Tobias Mourier, Saren Merk, Peter Arctander, 
Niels Tommerup, Niels Tolstrup, and Henrik Vissing 

TITLE: Oligonucleotides Useful for Detecting and Analyzing Nucleic 

Acids of Interest 


F:\50287\50287.007002 UTILITY COVER SHEET.DOC 


PATENT 

ATTORNEY DOCKET NO. 50287/007002 


OLIGONUCLEOTIDES USEFUL FOR DETECTING AND ANALYZING NUCLEIC 
5 ACIDS OF INTEREST 
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15 50287.007002jmprediction.txt, 3 kB; 50287.007002_tmthermodynamic.txt, 47 kB, all created 
on October 19, 2003, has been submitted in duplicate and is hereby incorporated by reference. 

BACKGROUND OF THE INVENTION 
The invention relates to nucleic acids and methods for expression profiling of mRNAs, 
20 identifying and profiling of particular mRNA splice variants, and detecting mutations, deletions, 
or duplications of particular exons, e.g., alterations associated with a disease such as cancer, in a 
nucleic acid sample, e.g., a patient sample. The invention furthermore relates to methods for 
detecting nucleic acids by fluorescence in situ hybridization. 

The field of the invention is oligonucleotides (e.g., oligonucleotide arrays) that are useful 
25 for detecting nucleic acids of interest and for detecting differences between nucleic acid samples 
(e.g, such as samples from a cancer patient and a healthy patient). 

DNA chip technology utilizes minituarized arrays of DNA molecules immobilized on 
solid surfaces for biochemical analyses. The power of DNA microarrays as experimental tools 
relies on the specific molecular recognition via complementary base-pairing, which makes them 
30 highly useful for massive parallel analyses. In the post-genomic era, microarray technology has 
thus become the method of choice for many hybridization-based assays, such as expression 
profiling, SNP detection, DNA re-sequencing, and genotyping on a genomic scale. 
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Expression microarrays are capable of profiling gene expression patterns of tens of 
thousands of genes in a single experiment. Hence, this technology provides a powerful tool for 
deciphering complex biological systems, and thereby greatly facilitates research in basic biology 
and living processes, as well as disease diagnostics, theranostics, and drug development. In a 
5 typical cell, the mRNAs are distributed in three frequency classes: (i) superprevalent (10-20% of 
the total mRNA mass); (ii) intermediate (40-45%); and (iii) low-abundant mRNAs (40-45%). It 
is therefore of utmost importance that the dynamic range and sensitivity of the expression arrays 
are optimal, especially when analyzing expression levels of messages or mRNA splice variants 
belonging to the low-abundant class. 

10 The recent explosion of interest in DNA microarray technology has been sparked by two 

key innovations. The first was the use of non-porous solid support, such as glass or polymer as 
opposed to nylon or nitrocellulose filters, which has facilitated miniaturization and fluorescence- 
based detection. Roughly 20,000 cDNAs can be robotically spotted onto a microscope slide and 
hybridized with a double-labeled probe. The second was the development of methods for high- 

15 density spatial synthesis of oligonucleotides. The two key array technologies are outlined in the 
following. 

Oligonucleotide arrays 

An efficient strategy for oligonucleotide microarray manufacturing involves DNA 
synthesis on solid surfaces using combinatorial chemistry. Most of the current technology is 

20 developed by AfTymetrix and Rosetta Inpharmatics. Glass is currently preferred as the synthesis 
support because of its inert chemical properties and low level of intrinsic fluorescence as well as 
the ability to chemically derivatize the surface. Of the three approaches currently used to 
manufacture oligonucleotide arrays, the light-directed deprotection method is the most effective 
one in generating high density microchips. A single round of synthesis involves light-directed 

25 deprotection, followed by nucleotide coupling. Photolithographic masking is used to control the 
regions of the chip designated for illumination. Affymetrix uses a combination of 
photolithography and combinatorial chemistry to manufacture its GeneChip Arrays. Using 
technologies adapted from the semiconductor industry, GeneChip manufacturing begins with a 
5-inch square quartz wafer. Initially the quartz is washed to ensure uniform hydroxylation across 

30 its surface. The wafer is placed in a bath of silane, which reacts with the hydroxyl groups of the 
quartz and forms a matrix of covalently linked molecules. The distance between these silane 
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molecules determines the probes' packing density, allowing arrays to hold over 500,000 features 
within 1 .28 square centimeters. The principal disadvantage of this method is that a significant 
amount of chip design work and cost is associated with the mask design. Once a set of masks 
has been made, a large number of chips can be produced at a reasonable cost. The current 
5 pricing of oligonucleotide arrays available from Affymetrix are in the range of 5-10 fold more 
expensive than cDNA microarrays. 

DNA-DNA hybridization using oligonucleotide chips is clearly different from that of 
cDNA microarrays. Hybridizations involving oligos are much more sensitive to the GC content 
of individual heteroduplexes. In addition, single base mismatches have a pronounced effect on 
10 the hybridization reassociation of short oligos, and point mutations can thus be readily detected 
using oligo chips. 
cDNA microarrays 

cDNA microarrays containing large DNA segments such as cDNAs are generated by 
physically depositing small amounts of each DNA of interest onto known locations on glass 

1 5 surfaces. Two technologies for printing microarrays are (1) mechanical microspotting, and (2) 
ink-jetting. Mechanical microspotting has been extensively used at, e.g., Stanford University, 
and it utilizes pins or capillaries to deposit small quantities of DNA onto known addresses using 
motion control systems. Recent advances in microspotting technology using modem arraying 
robots allow for the preparation of 100 microarrays containing over 10,000 features in less than 

20 12 hours. A DNA arrayer is relatively easy to set up, and the cost is usually low compared to on- 
chip oligoarrayers. cDNA microarrays are capable of profiling gene expression patterns of tens 
of thousands of genes in a single experiment. To compare the relative abundance of the arrayed 
gene sequences in two DNA or RNA samples, e.g.j the total mRNA isolated from two different 
cell populations, the two samples are first labeled using two different fluorescent dyes such as 

25 Cy-3 and Cy-5. The labeled samples are mixed and hybridized to the clones on the array slide. 
After the hybridization, laser excitation of the incorporated, fluorescent target molecules yields 
an emission with a characteristic spectra, which is measured with a confocal laser scanner. The 
monochrome images from the scanner are imported to the software in which the images are 
pseudo-colored and merged. Data from a single hybridization is viewed as a normalized ratio in 

30 which significant deviations from the ratio are indicative of either increased or decreased 
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expression levels relative to the reference sample. Data from multiple experiments can be 
examined using any number of data mining tools. 
Current status of array technology 

It has now become clear that cDNA microarrays, originally developed by Pat Brown and 
5 co-workers at the Stanford University, are sensitive, but may not be sufficiently specific with 
respect to, e.g., discrimination of homologous transcripts in gene families and alternatively 
spliced isoforms. On the other hand, the Affymetrix GeneChip system is specific, but may not 
be sensitive enough. This lack of sensitivity may explain why Affymetrix uses 16x 26-mer 
perfect match capture probes together with 16x25-mer mismatch probes per transcript in its 

10 expression profiling chips resulting in enormous data sets in genome-wide arrays. Therefore, the 
functional genomics field is in the process of switching, as they run out of samples, from existing 
PCR-amplified cDNA fragment libraries for microarraying to custom longmer oligonucleotide 
arrays comprising transcript-specific oligonucleotide capture probes typically in the range of 30- 
mers to 80-mers, thus addressing both specificity and sensitivity. 

15 A Iternative splicing 

As the field of genomics research is shifting from the acquisition of genome sequences to 
high-throughput functional genomics, there is an increasing need to understand the dynamics 
within the genetic regulation as well as RNA and protein sequences in order to elucidate gene 
expression in all its complexity. A common feature for eukaryotic genes is that they are 

20 composed of protein-encoding exons and introns. Introns (intra-genic-regions) are non-coding 
DNA which interrupt the exons. Introns are characterized by being excised from the pre-mRNA 
molecule in RNA splicing, as the sequences on each side of the intron are spliced together. RNA 
splicing not only provides functional mRNA, but is also responsible for generating additional 
diversity. This phenomenon is called alternative splicing, which results in the production of 

25 different mRNAs from the same gene. The mRNAs that represent isoforms arising from a single 
gene can differ by the use of alternative exons or retention of an intron that disrupts two exons. 
This process often leads to different protein products that may have related or drastically 
different, even antagonistic, cellular functions. There is increasing evidence indicating that 
alternative splicing is very widespread (Croft et al Nature Genetics, 2000). Recent studies have 

30 revealed that at least 60% of the roughly 35,000 genes in the human genome are alternatively 
spliced. Clearly, by combining different types of modifications and thus generating different 
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possible combinations of transcripts of different genes, alternative splicing is a potent 
mechanism for generating protein diversity. Analysis of the spliceome, in turn, represents a 
novel approach to both functional genomics and pharmacogenomics. 
Antisense transcription in eukaryotes 
5 RNA-mediated gene regulation is widespread in higher eukaryotes and complex genetic 

phenomena like RNA interference, co-suppression, transgene silencing, imprinting, methylation, 
and possibly position-effect variegation and transvection, all involve intersecting pathways based 
on or connected to RNA signalling (Mattick 2001; EMBO reports 2, 1 1 : 986-991). Recent 
studies indicate that antisense transcription is a very common phenomenon in the mouse and 

10 human genomes (Okazaki et al. 2002; Nature 420: 563-573; Yelin et al. 2003, Nature 

Biotechnol.). Thus, antisense modulation of gene expression in e.g. human cells may be a 
common regulatory mechanism. In light of this, the present invention provides novel tools, in 
which non-naturally occurring nucleic acids, such as LNA oligonucleotides, can be designed to 
silence or modulate the regulation of a given mRNA by non-coding antisense RNA, by designing 

15 a complementary sense LNA oligonucleotide for the regulatory antisense RNA. This has a high 
potential in target identification, target validation and therapeutic use of LNA oligonucleotides as 
modulating and silencing sense nucleic acid agents. 
Misplaced control of alternative splicing can cause disease 

The detection of the detailed structure of all transcripts is an important goal for molecular 

20 characterization of a cell or tissue. Without the ability to detect and quantify the splice variants 
present in one tissue, the transcript content or the protein content cannot be described accurately. 
Molecular medical research shows that many cancers result in altered levels of splice variants, so 
an accurate method to detect and quantify these transcripts is required. Mutations that produce 
an aberrant splice form can also be the primary cause of such severe diseases such as spinal 

25 muscular dystrophy and cystic fibrosis. 

Much of the study of human disease, indeed much of genetics is based upon the study of 
a few model organisms. The evolutionary stability of alternative splicing patterns and the degree 
to which splicing changes according to mutations and environmental and cellular conditions 
influence the relevance of these model systems. At present, there is little understanding of the 

30 rates at which alternative splicing patterns change, and the factors influencing these rates. Table 
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1 shows a set of genes that are known to be alternatively spliced and that are orthologs of known 
human disease genes. 

Table 1. C. elegans disease orthologs that are known to be differentially spliced in C elegans. 
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Disease 

C. elegans gene 

BLAST £ value 

brABLl 

M79.1A 

l.OOE-162 

X-Linked Lymphoprol.-SH2Dl A 

M79.1A 

2.00E-58 

Cyclin Dep. Kinase 4-CDK4 

F18H3.5A 

l.OOE-124 

HNPCC*-PMS2 

H12C20.2A 

l.OOE-123 

Neurofibromatosis 2-NF2 

C01G8.5A 

5.00E-163 

Duchenne MD+-DMD 

F32B4.3A 

O.OOE+00 

Cofnn-Lowry-RPS6KA3 

T01H8.1A 

2.00E-13 

Septooptic Dysplasia-HESXl 

Y113G7A.6A 

l.OOE-152 

Non-Insulin Dep. Diabet.-PCSKl 

F11A6.1A 

l.OOE-166 

Bartter's-SLC12A1 

Y37A1C.1A 

l.OOE-167 

Gitelmans-SLC12A3 

Y37A1C.1A 

O.OOE+00 

Hered. Spherocytosis- ANKl 

B0350.2A 

l.OOE-09 

Darier-White-SERCA 

K11D9.2A 

O.OOE+00 

Spondyloepip.Dysp .-C0L2A1 

F01G12.5A/let-2 

9.00E-20 


Previously, other microarray analyses have been performed with the aim of detecting 
either splicing of RNA transcripts per se in yeast, or of detecting putative exon skipping splicing 
events in rat tissues, but neither of these approaches had sufficient resolution to estimate 
10 quantities of splice variants, a factor that could be essential to an understanding of the changes in 
cell life cycle and disease. 

Thus, improved methods are needed for nucleic acid amplification, hybridization, and 
classification. Desirable methods can distinguish between mRNA splice variants and quantitate 
the amount of each variant in a sample. Other desirable methods can detect differences in 
1 5 expressions patterns between patient nucleic acid samples and nucleic acid standards. 
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SUMMARY OF THE INVENTION 

The present invention demonstrates the usefulness of LNA-modified oligonucleotides in 
the construction of highly specific and sensitive microarrays for expression profiling (e.g., 
5 mRNA splice variant detection) and comparative genomic hybridization. The invention provides 
novel technology platforms based on nucleic acids with LNA or other high affinity nucleotides 
for sensitive and specific assessment of altemative splicing using microarray technology. As 
opposed to high-density cDNA or DNA oligonucleotide microarrays, LNA microarrays are able 
to discriminate between highly homologous as well as differentially spliced transcripts. The 

10 invention furthermore provides methods for highly sensitive and specific nucleic acid detection 
by fluorescence in situ hybridization using LNA-modified oligonucleotides. The present methods 
greatly facilitate the analysis of gene expression patterns from a particular species, tissue, cell 
type. The analysis of the human spliceome provides important information for 
pharmacogenetics. Thus, the present methods are highly valuable in medical research and 

15 diagnostics as well as in drug development and toxicological studies. 

In general, the invention features populations of high affinity nucleic acids that have 
duplex stabilizing properties and thus are useful for a variety of nucleic acid detection, 
amplification, and hybridization methods (e.g., expression or mRNA splice variant profiling). 
Some of these oligonucleotides contain novel nucleotides created by combining specialized 

20 synthetic nucleobases with an LNA backbone, thus creating high affinity oligonucleotides with 
specialized properties such as reduced sequence discrimination for the complementary strand or 
reduced ability to form intramolecular double stranded structures. The invention also provides 
improved methods for identifying nucleic acids in a sample and for classifying a nucleic acid 
sample by comparing its pattern of hybridization to an array to the corresponding pattern of 

25 hybridization of one or more standards to the array (e.g., comparative genomic hybridization). 

Other desirable modified bases have decreased ability to self-anneal or to form duplexes 
with oligonucleotides containing one or more modified bases. The invention also provides 
arrays of nucleic acids containing these modified bases that have a decreased variance in melting 
temperature and/or an increased capture efficiency compared to naturally-occurring nucleic 

30 acids. These arrays as well as the oligonucleotides in solution can be used in a variety of 
applications for the detection, characterization, identification, and/or amplification of one or 
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more target nucleic acids. These oligonucleotides and oligonucleotides of the invention in 
general can also be used for solution assays, such as homogeneous assays. 

Merged Probes 

In one aspect, the invention features a non-naturally-occurring nucleic acid with a 
5 melting temperature that is at least 3, 5, 8, 10, 12, 15, 20, 25, 30, 35, or 40X higher than 
that of the corresponding control nucleic acid with 2'-deoxynucleotides. The nucleic acid 
hybridizes to a first region within a first exon of a target nucleic acid and to a second 
region within a second exon of the target nucleic acid that is adjacent to the first exon. 

In a related aspect, the invention provides a non-naturally-occurring nucleic acid 
10 with a melting temperature that is at least 3, 5, 8, 10, 12, 15, 20, 25, 30, 35, or 40°C 

higher than that of the corresponding control nucleic acid with 2 -deoxynucleotides. The 
nucleic acid hybridizes to a first region within an exon of a target nucleic acid and to a 
second region within an intron of the target nucleic acid that is adjacent to the exon. 

In another aspect, the invention features a non-naturally-occurring nucleic acid 
15 with a melting temperature that is at least 3, 5, 8, 10, 12, 15, 20, 25, 30, 35, or 40°C 

higher than that of the corresponding control nucleic acid with 2'-deoxynucleotides. The 
nucleic acid hybridizes to a first region within a first intron of a target nucleic acid and to 
a second region within a second intron of the target nucleic acid that is adjacent to the 
first intron. 

20 In yet another aspect, the invention provides a nucleic acid that is a non-naturally- 

occurring nucleic acid with a capture efficiency that is at least 10, 25, 50, 100, 150, 200, 
500, 800, 1000, or 1200% greater than that of a corresponding control nucleic acid with 
2'-deoxynucleotides at the temperature equal to the melting temperature of the nucleic 
acid. The nucleic acid hybridizes to a first region within a first exon of a target nucleic 

25 acid and to a second region within a second exon of the target nucleic acid that is 
adjacent to the first exon. 

In a related aspect, the invention features a nucleic acid that is a non-naturally- 
occurring nucleic acid with a capture efficiency that is at least 10, 25, 50, 100, 150, 200, 
500, 800, 1000, or 1200% greater than that of a corresponding control nucleic acid with 

30 2'-deoxynucleotides at the temperature equal to the melting temperature of the nucleic 
acid. The nucleic acid hybridizes to a first region within an exon of a target nucleic acid 


and to a second region within an intron of the target nucleic acid that is adjacent to the 
exon. 

In yet another aspect, the invention provides a nucleic acid that is a non-naturally- 
occurring nucleic acid with a capture efficiency that is at least 10, 25, 50, 100, 150, 200, 
5 500, 800, 1000, or 1200% greater than that of a corresponding control nucleic acid with 
2'-deoxynucleotides at the temperature equal to the melting temperature of the nucleic 
acid. The nucleic acid hybridizes to a first region within a first intron of a target nucleic 
acid and to a second region within a second intron of the target nucleic acid that is 
adjacent to the first intron. 
10 In desirable embodiments, the nucleic acids of the invention featuring a non- 

naturally occurring nucleic acid exhibit increased duplex stability due to slower rates of 
dissociation of the nucleic acid complexes (the off-rate) (Christensen et al 2001, 
Biochem. J. 354: 481-484). 

In one aspect of the invention the structure of desirable adenosine, thymine, 
15 guanine and cytosine analogs are those disclosed in PCT Publication No. WO 97/12896, 
Formula 5, 6, 7, 8, 9, 10, 1 1, 12 and 13. These modified bases may be incorporated as 
part of an LNA, DNA, or RNA unit and used any of the oligomers of the invention. 

In still another aspect, the invention features a nucleic acid that is an LNA (i.e., a 
nucleic acids with one or more LNA units) and that hybridizes to a first region within a 
20 first exon of a target nucleic acid and to a second region within a second exon of the 
target nucleic acid that is adjacent to the first exon. 

In another aspect, the invention features a nucleic acid that is an LNA and that 
hybridizes to a first region within an exon of a target nucleic acid and to a second region 
within an intron of the target nucleic acid that is adjacent to the exon. 
25 In one aspect, the invention provides nucleic acid that is an LNA and that 

hybridizes to a first region within a first intron of a target nucleic acid and to a second 
region within a second intron of the target nucleic acid that is adjacent to the first intron. 

In desirable embodiments of any of the above aspects, the length of the segment 
of the nucleic acid hybridizing to the first region and the length of the segment of the 
30 nucleic acid hybridizing to the second region are between 3 and 50 nucleotides, 1 0 and 40 
nucleotides, or 20 and 30 nucleotides, inclusive. The length of the segment of the nucleic 
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acid hybridizing to the first region and the length of the segment of the nucleic acid 
hybridizing to the second region may be the same length or different lengths. Desirably, 
the nucleic acid containing LNA units are symmetrically spaced on both sides of a 
junction between either two exons, an exon and an intron, or two introns, or alternatively, 
5 the nucleic acid containing LNA units are spaced on both sides of a junction based on 
equalized duplex melting temperatures of the segments. Desirably, the nucleic acid has 
one or more LNA units within 5, 4, 3, 2, or 1 nucleotides of a junction between either two 
exons, an exon and an intron, or two introns. 

In another aspect, the invention features a population of nucleic acids that 
10 includes one or more nucleic acids of any one of the above aspects. 
Internal Probes 

In another aspect, the invention features a non-naturally-occurring nucleic acid 
with a melting temperature that is at least 3, 5, 8, 1 0, 1 2, 1 5, 20, 25, 30, 35, or 40^*0 
higher than that of the corresponding control nucleic acid with 2'-deoxynucleotides. The 
1 5 nucleic acid hybridizes to only one exon or to only one intron of a target nucleic acid. 

In a related aspect, the invention features a non-naturally-occurring nucleic acid 
with a capture efficiency that is at least 10, 25, 50, 100, 150, 200, 500, 800, 1000, or 
1200% greater than that of a corresponding control nucleic acid with 2'-deoxynucleotides 
at the temperature equal to the melting temperature of the nucleic acid. The nucleic acid 
20 hybridizes to only one exon or to only one intron of a target nucleic acid. 

In another aspect, the invention features a nucleic acid that is an LNA and that 
hybridizes to only one exon or to only one intron of a target nucleic acid. 

In desirable embodiments of the above aspects for nucleic acids that hybridizes to 
only one exon or only one intron, the nucleic acid does not hybridize to both an exon and 
25 an intron. 

In another aspect, the invention features a population of nucleic acids that 
includes one or more nucleic acids of any one of the above aspects. 
Pharmaceutical Compositions and Nucleic Acid Populations 

In another aspect, the invention features a pharmaceutical composition that includes one or 
30 more of the nucleic acids of the invention and a pharmaceutically acceptable carrier, such as one 
of the carriers described herein. 
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In another aspect, the invention features a population of two or more nucleic acids of the 
invention. The populations of nucleic acids of the invention may contain any number of unique 
molecules. For example, the population may contain as few as 10, 10^ 10"*, or 10^ unique 
molecules or as many as io\ lo^ 10^ or more unique molecules. In desirable embodiments, at 
5 least 1,5, 10, 50, 100 or more of the polynucleotide sequences are a non-naturally-occurring 
sequence. Desirably, at least 20, 40, or 60% of the unique polynucleotide sequences are non- 
naturally-occurring sequences. Desirably, the nucleic acids are all the same length; however, 
some of the molecules may differ in length. 
Desirable Embodiments of Any of the Above Aspects 

10 In desirable embodiments of any of the above aspects, the length of one or more 

nucleic acids (e.g., nucleic acids in a nucleic acid population of the invention) is between 
15 and 150 nucleotides, 5 and 100 nucleotides, 20 and 80 nucleotides, or 30 and 60 
nucleotides in length, inclusive. In particular embodiments, the nucleic acid is 6, 7, 8, 9, 
10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 30, 40, 50 nucleotides or at least 60, 70, 80, 90, 

15 100, 120, or 130 nucleotides in length. In additional embodiments, the nucleic acid is 
between 8 and 40 nucleotides, such as between 9 and 30, or 12 and 25, or 1 5 and 20 
nucleotides. Desirably, at least 5, 10, 15, 20, 30, 40, 50, 60, or 70% of the nucleotides in 
the nucleic acid are LNA units. In desirable embodiments, every second nucleotide, 
every third, every fourth nucleotide, every fifth nucleotide, or every sixth nucleotide in 

20 the nucleic acid is an LNA unit. In various embodiments, (i) every second and every 
third nucleotide, (ii) every second and every fourth nucleotide, (iii) every second and 
every fifth nucleotide, (iv) every second and every sixth nucleotide, (v) every third and 
every fourth nucleotide, (vi) every third and every fifth nucleotide, (vii) every third and 
every sixth nucleotide, (viii) every fourth and every fifth nucleotide, (ix) every fourth and 

25 every sixth nucleotide, and/or (x) every fifth and every sixth nucleotide in the nucleic 

acid is an LNA unit. Desirably, every second, every third, and every fourth nucleotide in 
the nucleic acid is an LNA unit. In desirable embodiments, the nucleic acids of the 
invention have one or more of the following substitution patterns which is repeated 
throughout the nucleic acids: XxXx, xXxX, XxxXxx, xXxxXx, xxXxxX, XxxxXxxx, 

30 xXxxxXxx, xxXxxxXx, or xxxXxxxX in which "X" denotes an LNA unit and "x" 
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denotes a DNA or RNA unit. In some embodiments, the nucleotides that are not LNA 
units are naturally-occuring DNA or RNA nucleotides. 

In various embodiments, the nucleic acid comprises two or more contiguous LNA 
units. Desirably, the nucleic acid comprises at least 2, 3, 4, 5, 6, 7, or 8 contiguous LNA 
5 units. In desirable embodiments, the number of contiguous LNA units is between 5 and 
20% or 10 and 15% of the total length of the nucleic acid. In a particular embodiment, 5 
contiguous nucleotides of a 50-mer merged probe are LNA units. In one embodiment, 
the nucleic acid does not have greatly extended stretches of modified DNA or RNA 
residues, e.g. greater than about 4, 5, 6, 7, or 8 consecutive modified DNA or RNA 

10 residues. According to this embodiment, one or more non-modified DNA or RNA units 
are present after a consecutive stretch of about 3, 4, 5, 6, 7, or 8 modified nucleic acids. 

Other desirable nucleic acids have an LNA substitution pattern that results in the 
formation of negligible secondary structure by the nucleic acids with itself In one such 
embodiment, the nucleic acids do not form hairpins or do not form other secondary 

1 5 structures that would otherwise inhibit or prevent their binding to a target nucleic acid. 
Desirably, opposing nucleotides in a palindrome pair or opposing nucleotides in inverted 
repeats or in reverse complements are not both LNA units. In various embodiments, the 
nucleic acids in the first population form less than 3, 2, or 1 intramolecular base-pairs or 
base-pairs between two identical molecules. In desirable embodiments, the nucleic acid 

20 does not have LNA-5-nitroindole: LNA-5-nitroindole intramolecular base-pairs. 

In other desirable embodiments, at least one LNA unit (e.g., at least 2, 3, 4, 5, 6, 
7, 8, 9, or 10 LNA units) in the nucleic acid hybridizes to a first region within a first exon 
of a target nucleic acid and at least one LNA unit (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 
LNA units) in the nucleic acid hybridizes to a second region within a second exon of the 

25 target nucleic acid that is adjacent to the first exon. The number of LNA units that bind 
to each region can be the same or different. In some embodiments, the 5' terminal 
nucleotide of the nucleic acid is or is not an LNA unit. Desirably, the 3' terminal 
nucleotide of the nucleic acid is not an LNA unit (e.g., the nucleic acid may contain a 3' 
terminal naturally-occurring nucleotide). 

30 Desirably, the nucleic acid can distinguish between different nucleic acids (e.g., mRNA 

splice variants) that cannot be distinguished using a naturally-occurring control nucleic acid 
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(e.g., a control nucleic acid that consists of only 2*-deoxynucleotides such as a control nucleic 
acid of the same length as the nucleic acid of the invention). Desirably, the hybridization 
intensity of the nucleic acid for an exon of interest is at least 2, 3, 4, 5, 6, or 10 fold greater than 
the hybridization intensity of the nucleic acid for another exon in the same target nucleic acid 
5 (e.g., mRNA) or in another nucleic acid. Desirably, the hybridization intensity of the nucleic 
acid for target nucleic acid is at least 2, 3, 4, 5, 6, or 10 fold greater than the hybridization 
intensity for a non-target nucleic acid with less than 99, 95, 90, 80, 70, or 60% sequence identity 
to the target nucleic acid. 

Desirably, all of the nucleic acids of the population or all of the nucleic acids of a 

10 subpopulation of the population are the same length, hi some embodiments, the 
population includes one or more nucleic acids of a different length. In some 
embodiments, longer nucleic acids contain one or more nucleotides with universal bases. 
For example, nucleotides with universal bases can be used to increase the thermal 
stability of nucleic acids that would otherwise have a thermal stability lower than some or 

15 all of the nucleic acids in the population. In some embodiments, one or more nucleic 
acids have a universal base located at the 5* or 3' terminus of the nucleic acid. In 
desirable embodiments, one or more (e.g., 2, 3, 4, 5, 6, or more) universal bases are 
located at the 5' and 3' termini of the nucleic acid. Desirably, all of the nucleic acids in 
the population have the same number of universal bases. Desirable universal bases 

20 include inosine, pyrene, 3-nitropyrrole, and 5-nitroindole. 

In desirable embodiments, the nucleic acid has at least one LNA A or LNA T. In 
some embodiments, each nucleic acid has at least one LNA A or LNA T. Desirably, all 
of the adenine and thymine-containing nucleotides in the LNA are LNA A and LNA T, 
respectively. In some embodiments, a nucleic acid with a increased capture efficiency or 

25 melting temperature compared to a control nucleic acid has at least one LNA T or LNA 
C. In some embodiments, all of the thymine and cytosine-containing nucleotides in the 
LNA are LNA T and LNA C, respectively. In some embodiments, a nucleic acid with an 
increased specificity or decreased self-complementarity compared to a control nucleic 
acid has at least one LNA A or LNA C. In some embodiments, all of the adenine and 

30 cytosine-containing nucleotides in the LNA are LNA A and LNA C, respectively. 
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Desirably, at least 10, 20, 25, 30, 40, 50, 60, 70, 80, 90, or 100% of the nucleic acids in 
the population have one ore more LNA units. 

In desirable embodiments, the LNA has at least one 2,6,-diaminopurine, 2- 
aminopurine, 2-thio-thymine, 2-thio-uracil, inosine, or hypoxanthine base. Desirably, the 
5 LNA has a nucleotide with a 2'0, 4*C-methylene linkage between the 2' and 4' position of 
a sugar moiety. In some embodiments, one or more nucleic acids in the first population 
are LNA/DNA, LNA/RNA, or LNA/DNA/RNA chimeras. 

In desirable embodiments of any of the above aspects, the variance in the melting 
temperature of the population is at least 1 0, 20, 30, 40, 50, 60, or 70% less than the 

10 variance in the melting temperature of the corresponding control population of nucleic 
acids of the same length with 2'-deox3mucleotides (e.g., DNA nucleotides) instead of 
LNA units or other modified or non-naturally-occurring units. In desirable embodiments, 
the standard deviation in melting temperature is less than 10, 9.5, 9, 8.5, 8, 7.5, 7, 6.5, or 
6. In certain embodiment, the range in melting temperatures for nucleic acids in the 

15 population is less than 70, 60, 50, 40, 30, or 20^C. Desirably, the variance in the melting 
temperature of the population is less than 59, 50, 40, 30, 25, 20, 15, 10, or 5°C. 

In still other embodiments, the nucleic acids are covalently bonded to a solid 
support. Desirably, the nucleic acids are in a predefined arrangement. In various 
embodiments, the first population has at least 10; 100; 1,000; 5,000; 10,000; 100,000; or 

20 1 ,000,000 different nucleic acids. Desirably, the nucleic acids in the population together 
hybridize to at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 95, or 100% of the exons of a target 
nucleic acid. In desirable embodiments, the population includes nucleic acids that 
together hybridize to at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 95, or 100% of the nucleic 
acids expressed by a particular cell or tissue. In some embodiments, the population 

25 includes nucleic acids that together hybridize to at least one exon fi-om at least 1,5, 10, 
20, 25, 30, 40, 50, 60, 70, 80, 90, or 100% of the nucleic acid sequences expressed by a 
particular cell or tissue at a given point in time (e.g., an expression array with sequences 
corresponding to the sequences of mRNA molecules expressed by a particular cell type 
or a cell under a particular set of conditions). In some embodiments, the plurality of 

30 nucleic acids are used as PGR primers or FISH probes. 


14 


Desirable modified bases of the present invention when incorporated into the central 
position of a 9-mer oligonucleotide (all other eight residues or units being natural DNA or RNA 
units with natural bases) exhibit a Tm difference equal to or less than about 15, 12, 10, 9, 8, 7, 6, 
5, 4, 3 or 2^C upon hybridizing to the four complementary oligonucleotide variants that are 
5 identical except for the unit corresponding to the LNA unit, where each variant has one of the 
natural bases uracil, cytosine, thymine, adenine or guanine. That is, the highest and the lowest 
Tm (referred to herein as the T^ differential) obtained with such four complementary sequences 
is 15, 12, 10, 9, 8, 7, 6, 5, 4, 3 or 2°C or less. 

Modified nucleic acid oligomers of the invention desirably contain at least one LNA unit, 

10 such as an LNA unit with a modified nucleobase. Modified nucleobases or nucleosidic bases 
desirably base-pair with adenine, guanine, cytosine, uracil, or thymine. Exemplary oligomers 
contain 2 to 100, 5 to 100, 4 to 50, 5 to 50, 5 to 30, or 8 to 15 nucleic acid units. In some 
embodiments, one or more LNA units with natural nucleobases are incorporated into the 
oligonucleotide at a distance from the LNA unit having a modified base of 1 to 6 (e.g., 1 to 4) 

15 bases. In certain embodiments, at least two LNA units with natural nucleobases are flanking an 
LNA unit having a modified base. Desirably, at least two LNA units independently are 
positioned at a distance from the LNA unit having the modified base of 1 to 6 (e.g., 1 to 4 bases). 

Desirable modified nucleobases or nucleosidic bases for use in nucleic acid compositions 
of the invention include optionally substituted carbon alicyclic or carbocyclic aryl groups (i.e., 

20 only carbon ring members), particularly multi-ring carbocyclic aryl groups such as groups 
having 2, 3, 4, 5, 6, 7, or 8 linked, particularly fused carbocyclic aryl moieties. Optionally 
substituted pyrene is also desirable. Such nucleobases or nucleosidic bases can provide 
significant performance results, as demonstrated in the examples which follow. Heteroalicyclic 
and heteroaromatic nucleobases or nucleosidic bases also are suitable. In some embodiments, 

25 the carbocyclic moiety is linked to the 1 '-position of the LNA unit through a linker (e.g., a 
branched or straight alkylene or alkenylene). 

Desirable LNA units have a carbon or hetero alicyclic ring with four to six ring members, 
e.g. a furanose ring, or other alicyclic ring structures such as a cyclopentyl, cycloheptyl, 
tetrahydropyranyl, oxepanyl, tetrahydrothiophenyl, pyrrolidinyl, thianyl, thiepanyl, piperidinyl, 

30 and the like. In one aspect, at least one ring atom of the carbon or hetero alicyclic group is taken 
to form a further cyclic linkage to thereby provide a multi-cyclic group. The cyclic linkage may 
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include one or more, typically two atoms, of the carbon or hetero alicyclic group. The cyclic 
linkage also may include one or more atoms that are substituents, but not ring members, of the 
carbon or hetero alicyclic group. Other desirable LNA units are compounds having a substituent 
on the 2 '-position of the central sugar moiety (e.g., ribose or xylose), or derivatives thereof, 
5 which favors the C3'-endo conformation, commonly referred to as the North (or simply N for 
short) conformation. These LNA units include EN A (2'-0,4'-C-ethylene-bridged nucleic acids 
such as those disclosed in WO 00/47599) units as well as non-bridged riboses such as 2'-F or 2'- 
0-methyl. 

For any of the above aspects, an exemplary control nucleic acid has P-D-2- 

10 deoxyribose instead of one or more bicyclic or sugar groups of a LNA unit or other 
modified or non-naturally-occurring units in a nucleic acid of the first population. In 
some embodiments, the nucleic acid or population of the invention and the control 
nucleic acid or population only have naturally-occurring nucleobases. If a nucleic acid in 
the nucleic acid or population of the invention has one or more non-naturally-occurring 

1 5 nucleobases, the capture efficiency of the corresponding control nucleic acid is calculated 
as the average capture efficiency for all of the nucleic acids that have either A, T, C, G or 
mC (methyl Cytosin) in each position corresponding to a non-naturally-occurring 
nucleobase in the nucleic acid in the first population. 
Complex of Target Nucleic Acids and Nucleic Acid Probes 

20 In one aspect, the invention features a complex of one or more target nucleic acids 

and nucleic acid of the invention (e.g., nucleic acid probes) in which one or more target 
nucleic acids are hybridized to a plurality of nucleic acids of the invention. Desirably, at 
least 2, 3, 4, 5, 6, 7, 10, 1 5, 20, 30, or 40 different target nucleic acids are hybridized. In 
some embodiments, the target nucleic acids are cDNA molecules reverse transcribed 

25 from a patient sample or cRNA molecules amplified from a patient sample using a T7 
RNA polymerase-based linear amplification system or the like. The target nucleic acids 
are labeled prior to hybridization to the nucleic acids of invention. 
Methods for Detecting or Amplifying Target Nucleic Acids 

In one aspect, the invention features a method for detecting the presence of one or 

30 more target nucleic acids in a sample. This method involves incubating a nucleic acid 
sample with one or more nucleic acids of the invention under conditions that allow at 
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least one target nucleic acid to hybridize to at least one of the nucleic acids of the 
invention. Desirably, hybridization is detected for at least 2, 3, 4, 5, 6, 8, 10, or 12 target 
nucleic acids. In some embodiments, the method further includes contacting the target 
nucleic acid with a second nucleic acid or a population of second nucleic acids that binds 
5 to a different region of the target molecule than the first nucleic acid. Desirably, the 
method further involves identifying one or more hybridized target nucleic acids and/or 
determining the amount of one or more hybridized target nucleic acids. In desirable 
embodiments, the method further includes determining the presence or absence of an 
mRNA splice variant of interest in the sample and/or determining the presence or absence 
10 of a mutation, deletion, and/or duplication of an exon of interest. In some embodiments, 
the mutation, deletion, and/or duplication is indicative of a disease, disorder, or condition, 
such as cancer. 

In desirable embodiments of any of the above detection methods, at least 5, 10, 
15, 20, 30, 40, 50, 80, 100, 150, 200, or more target nucleic acids hybridize to the nucleic 

1 5 acids of the invention. Desirably, the method is repeated under one or more different 

incubation conditions. In particular embodiments, the method is repeated at 1 , 3, 5, 8, 10, 
15, 20, 30, 40 or more different temperatures, cation concentrations (e.g., concentrations 
of monovalent cations such as Na^ and or divalent cations such as Mg'^"^ and Ca^"*^), 
denaturants (e.g., hydrogen bond donors or acceptors that interfere with the hydrogen 

20 bonds keeping the base-pairs together such as formamide or urea). Desirably, the method 
also includes identifying the target nucleic acid hybridized to the nucleic acids of the 
invention and/or determining the amount of the target nucleic acid hybridized to the 
nucleic acids of the invention. In particular embodiments, the target nucleic acids are 
labeled with a fluorescent group. In certain embodiments, the labeling is repeated using 

25 different fluorescent groups (e.g., labelling for so-called dye-swap labeling experiments). 
In desirable embodiments, the determination of the amount of bound target nucleic acid 
involves one or more of the following: (i) adjusting for the varying intensity of the 
excitation light source used for detection of the hybridization, (ii) adjusting for 
photobleaching of the fluorescent group, and/or (iii) comparing the fluorescent intensity 

30 of the target nucleic acid(s) hybridized to the nucleic acids of the invention of nucleic 
acids to the fluorescent intensity of a different sample of nucleic acids hybridized to the 
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nucleic acids of the invention (e.g., a different sample hybridized to the same population 
of nucleic acids of the invention on the same or a different solid support such as the same 
chip or a different chip). Desirably, this comparison in fluorescent intensity involves 
adjusting for a difference in the amount of the nucleic acids of the invention used for 
5 hybridization to each sample and/or adjusting for a difference in the buffer (e.g., a 
difference in Mg^^ concentration) used for hybridization to each sample or scaling for 
different labeling efficiencies with different fluorochromes. 

Desirably, the target nucleic acids are cDNA molecufes reverse transcribed from a 
patient sample or cRNA molecules amplified using a T7 RNA polymerase-based linear 

10 amplification system or the like from a patient sample. In particular embodiments, the 
sample has nucleic acids that are amplified using one or more primers specific for an 
exon of a target nucleic acid, and the method involves determining the presence or 
absence of an mRNA splice variant with the exon in the sample. Desirably, one or more 
of the primers are specific for an exon or exon-exon junction of a pathogen of interest, 

1 5 and the method involves determining the presence or absence of a nucleic acid with the 
exon in the sample. 

In a desirable embodiment, the nucleic acids of the invention are covalently bonded to a 
solid support by reaction of a nucleoside phosphoramidite with an activated solid support, and 
subsequent reaction of a nucleoside phosphoramide with an activated nucleotide or nucleic acid 

20 bound to the solid support. In some embodiments, the solid support or the growing nucleic acid 
bound to the solid support is activated by illumination, a photogenerated acid, or electric current. 

In another aspect, the invention features a method for amplifying a target nucleic acid 
molecule. The method involves (a) incubating a first nucleic acid of the invention with a target 
nucleic acid under conditions that allow the first nucleic acid to bind the target nucleic acid; and 

25 (b) extending the first nucleic acid with the target nucleic acid as a template. Desirably, the 
method further involves contacting the target nucleic acid with a second nucleic acid (e.g., a 
second nucleic acid of the invention) that binds to a different region of the target nucleic acid 
than the first nucleic acid. In various embodiments, the sequence of the target nucleic acid is 
known or unknown. 

30 In one aspect, the invention features a method of detecting a nucleic acid of a pathogen 

(e.g., a nucleic acid in a sample such as a blood or urine sample from a mammal). This method 
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involves contacting a nucleic acid probe of the invention (e.g., a probe specific for an exon or a 
mRNA from a particular pathogen or family of pathogens) with a nucleic acid sample under 
conditions that allow the probe to hybridize to at least one nucleic acid in the sample. The probe 
is desirably at least 60, 70, 80, 90, 95, or 100% complementary to a nucleic acid of a pathogen 
5 (e.g., a bacteria, virus, or yeast such as any of the pathogens described herein). Hybridization 
between the probe and a nucleic acid in the sample is detected, indicating that the sample 
contains the corresponding nucleic acid from a pathogen. In some embodiments, the method is 
used to determine what strain of a pathogen has infected a mammal (e.g., a human) by 
determining whether a particular nucleic acid is present in the sample. In other embodiments, 

10 the probe has a universal base in a position corresponding to a nucleotide that varies among 

different strains of a pathogen, and thus the probe detects the presence of a nucleic acid from any 

of a multiple of pathogenic strains. 

Methods for Classifying Nucleic Acids Samples 

In one aspect, the invention features a method for classifying a test nucleic acid 

1 5 sample including target nucleic acids. This method involves (a) incubating a test nucleic 
acid sample with a one or more nucleic acids of the invention under conditions that allow 
at least one of the nucleic acids in the test sample to hybridize to at least one nucleic acid 
of the invention, (b) detecting a hybridization pattern of the test nucleic acid sample, and 
(c) comparing the hybridization pattern to a hybridization pattern of a first nucleic acid 

20 standard, whereby the comparison indicates whether or not the test sample has the same 
classification as the first standard. Desirably, the method also includes comparing a 
hybridization pattern of the test nucleic acid sample to a hybridization pattern of a second 
standard. In various embodiments, a hybridization pattern of the test nucleic acid sample 
is compared to at least 3, 4, 5, 8, 10, 15, 20, 30, 40, or more standards. 

25 Desirably, the method also includes identifying the hybridized target nucleic acid 

and/or determining the amount of hybridized target nucleic acid. In particular 
embodiments, the target nucleic acids are labeled with a fluorescent group. Desirably, 
the first nucleic acid standard is labeled with a different fluorescent group. The 
fluorescence of the target nucleic acids and the first nucleic acid standard can be detected 

30 simultaneously or sequentially. 


19 


In desirable embodiments, the method further includes determining the presence 
or absence of an mRNA splice variant of interest in the sample and/or determining the 
presence or absence of a mutation, deletion, and/or duplication of an exon of interest. In 
some embodiments, the mutation, deletion, and/or duplication is indicative of a disease, 
5 disorder, or condition, such as cancer. 

In desirable embodiments, the determination of the amount of bound target 
nucleic acid involves one or more of the following: (i) adjusting for the varying intensity 
of the excitation light source used for detection of the hybridization, (ii) adjusting for 
photobleaching of the fluorescent group, and/or (iii) comparing the fluorescent intensity 

10 of the target nucleic acid(s) hybridized to the nucleic acids of the invention to the 

fluorescent intensity of a different sample of nucleic acids hybridized to the nucleic acids 
of the invention (e.g., a different sample hybridized to same set of nucleic acids of the 
invention on the same or a different solid support such as the same chip or a different 
chip). Desirably, this comparison in fluorescent intensity involves adjusting for a 

1 5 difference in the amount of the plurality used for hybridization to each sample and/or 
adjusting for a difference in the buffer (e.g., a difference in Mg^^ concentration) used for 
hybridization to each sample. 

Desirably, the nucleic acids in the population together hybridize to at least 10, 20, 
30, 40, 50, 60, 70, 80, 90, 95, or 100% of the exons of a target nucleic acid. In desirable 

20 embodiments, the population includes nucleic acids that together hybridize to at least 10, 
20, 30, 40, 50, 60, 70, 80, 90, 95, or 100% of the nucleic acids expressed by a particular 
cell or tissue. In some embodiments, the population includes nucleic acids that together 
hybridize to at least one exon from at least 1, 5, 10, 20, 25, 30, 40, 50, 60, 70, 80, 90, or 
100% of the nucleic acid sequences expressed by a particular cell or tissue at a given 

25 point in time (e.g., an expression array with sequences corresponding to the sequences of 
mRNA molecules expressed by a particular cell type or a cell under a particular set of 
conditions). Desirably, the method further includes using a nucleic acid or a region of a 
nucleic acid that is present in a first test sample but not present in a first standard or not 
present in a second test sample as a probe or primer for the detection, amplification, or 

30 characterization of the nucleic acid. 
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In desirable embodiments of any of the above methods, at least 5, 10, 15, 20, 30, 
40, 50, 80, 100, 150, 200, or more target nucleic acids hybridize to the nucleic acids of 
the invention. Desirably, the method is repeated under one or more different incubation 
or hybridization conditions, hi particular embodiments, the method is repeated at 1, 3, 5, 
5 8, 10, 15, 20, 30, 40 or more different temperatures, cation concentrations (e.g., 

concentration of monovalent cations such as Na^ and K"*^ or divalent cations such as Mg^^ 
and Ca ^),' denaturants (e.g., hydrogen bond donors or acceptors that interfere with the 
hydrogen bonds keeping the base-pairs together such as formamide or urea). 

In particular embodiments, the sample has nucleic acids that are amplified using 

10 one or more primers specific for an exon of a target nucleic acid, and the method involves 
determining the presence or absence of an mRNA splice variant with the exon in the 
sample. Desirably, one or more of the primers are specific for an exon or exon-exon 
junction of a pathogen of interest, and the method involves determining the presence or 
absence of a nucleic acid with the exon in the sample. 

15 Desirably, the comparison of the hybridization pattern of a patient nucleic acid 

sample to that of one or more standards is used to determine whether or not a patient has 
a particular disease, disorder, condition, or infection or an increased risk for a particular 
disease, disorder, condition, or infection. In some embodiments, the comparison is used 
to determine what pathogen has infected a patient and to select a therapeutic for the 

20 treatment of the patient. Desirably, the comparison is used to select a therapeutic for the 
treatment or prevention of a disease or disorder in the patient. In yet other embodiments, 
the comparison is used to include or exclude the patient from a group in a clinical trial. 
Desirably, the comparison is used to compare the expression of nucleic acids (e.g., 
mRNA splice forms associated with toxicity) in the presence and absence of a candidate 

25 compound (e.g., a lead compound for drug development). In other embodiments, the 
comparison is used to determine differences in expression of nucleic acids (e.g., mRNA 
splice variants) under particular conditions (e.g., under different environmental stress 
conditions) or at different developmental time points. In particular embodiments, the 
expression of one or more members from a particular enzyme class (e.g., protein kinase 

30 splice variants) is measured. 
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In a desirable embodiment, the nucleic acids of the invention are covalently bonded to a 
solid support by reaction of a nucleoside phosphoramidite with an activated solid support, and 
subsequent reaction of a nucleoside phosphoramide with an activated nucleotide or nucleic acid 
bound to the solid support. In some embodiments, the solid support or the growing nucleic acid 
5 bound to the solid support is activated by illumination, a photogenerated acid, or electric current. 

The use of a variety of different monomers in the nucleic acids of the invention offers a 
means to "fine tune" the chemical, physical, biological, pharmacokinetic, and pharmacological 
properties of the nucleic acids thereby facilitating improvement in their safety and efficacy 
profiles when used as a therapeutic drug. 
1 0 Applications for the Nucleic Acids of the Invention 

In another aspect, the invention features the use of one ore more nucleic acids of 
the invention for the detection, amplification, or classification of a nucleic acid of interest 
or a population of nucleic acids of interest. 

In another aspect, the invention features the use of one or more nucleic acids of 
15 the invention for alternative mRNA splice variant detection, expression profiling, 
comparative genomic hybridization, or real-time PGR. In exemplary real-time PGR 
applications, the nucleic acids are used to determine the amount of one or more target 
nucleic acids (e.g., mRNA splice variants) in a sample. In particular embodiments, 
fluorescently labeled RT-PCR products from the amplification of a test nucleic acid 
20 sample are hybridized to a population of nucleic acids of the invention. Desirably, the 
amount of one or more RT-PGR products is measured to determine the amount of the 
corresponding nucleic acid in the initial sample. 

In yet another aspect, the invention features the use of a nucleic of the invention as a PGR 
primer or FISH probe. 
25 Methods for Selecting a Population of Nucleic Acid 

In one aspect, the invention features a method of selecting a nucleic acid for a 
population of nucleic acids. This method involves (a) determining the melting 
temperature of a nucleic acid of the invention, determining the ability of the nucleic acid 
to self-anneal, determining the ability of the nucleic acid to hybridize to one or more 
30 exons or introns of a target nucleic acid, and/or determining the ability of the nucleic acid 
to hybridize to a non-target nucleic acid, and (b) selecting the nucleic acid for inclusion 
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or exclusion from the population based on the determination in step (a). In desirable 
embodiments, step (a) is performed for at least 2, 3, 4, 5, 6, 10, 20, 50, 100, 200, 500, 
1,000, 5,000 or more nucleic acids, and a subset of the nucleic acids are selected for 
inclusion in the population based on the determination in step (a). Desirably, the nucleic 
5 acids with the highest melting temperatures and/or ability to hybridize to one or more 
exons or introns of a target nucleic acid are selected. Desirably, the nucleic acids with 
the lowest ability to self-anneal and/or hybridize to a non-target nucleic acid are selected. 
Databases with Hybridization Patterns of Nucleic Acids Samples and/or Standards 

The invention also features a variety of databases. These databases are useful for storing 

10 the information obtained in any of the methods of the invention. These databases may also be 
used in the diagnosis of disease or an increased risk for a disease or in the selection of a desirable 
therapeutic for a particular patient or class of patients. 

Accordingly, in one such aspect, the invention provides an electronic database including 
at least 1, 10, 10^ 10^ 5 x lo\ 10^ 10^ 10^ 10^ 10^ or 10^ records of a nucleic acid of interest 

15 or a population of nucleic acids of interest (e.g., one or more nucleic acids in a standard or in a 
test nucleic acid sample) correlated to records of its hybridization pattern to a plurality of nucleic 
acids of the invention under one or more incubation conditions (e.g., one or more temperatures, 
denaturant concentrations, or salt concentrations). 

In another aspect, the invention features computer including the database of the above 

20 aspect and a user interface (i) capable of displaying a hybridization pattem for a nucleic acid of 
interest or a population of nucleic acids of interest whose record is stored in the computer or (ii) 
capable of displaying a nucleic acid of interest (e.g., displaying the polynucleotide sequence or 
another identifying characteristic of the nucleic acid of interest) or a population of nucleic acids 
of interest that produces a hybridization pattem whose record is stored in the computer. 

25 Methods for Silencing a Target Nucleic Acid in a Cell or Animal 

One method for inhibiting specific gene expression involves the use of antisense or 
double stranded oligonucleotides, which are complementary to a specific target messenger RNA 
(mRNA) sequence, such as a specific mRNA splice variant. Of special interest are 
oligonucleotides with a modified backbone (such as LNA or phosphorothioate) that are not 

30 readily degraded by endonucleases in the target cells. 
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In one aspect, the invention features the use of a nucleic acid of the invention for the 
manufacture of a pharmaceutical composition for treatment of a disease curable by an antisense 
or RNAi technology. 

In one aspect, the invention provides a method for inhibiting the expression of a target 
5 nucleic acid in a cell. The method involves introducing into the cell a nucleic acid of the 

invention in an amount sufficient to specifically attenuate expression of the target nucleic acid. 
The introduced nucleic acid has a nucleotide sequence that is essentially complementary to a 
region of desirably at least 20 nucleotides of the target nucleic acid. Desirably, the cell is in a 
manmial. 

10 In a related aspect, the invention provides a method for preventing, stabilizing, or treating 

a disease, disorder, or condition associated with a target nucleic acid in a mammal. This method 
involves introducing into the mammal a nucleic acid of the invention in an amount sufficient to 
specifically attenuate expression of the target nucleic acid, wherein the introduced nucleic acid 
has a nucleotide sequence that is essentially complementary to a region of desirably at least 20 

1 5 nucleotides of the target nucleic acid. 

In another aspect, the invention provides a method for preventing, stabilizing, or treating 
a pathogenic infection in a mammal by introducing into the mammal a nucleic acid of the 
invention in an amount sufficient to specifically attenuate expression of a target nucleic acid of a 
pathogen. The introduced nucleic acid has a nucleotide sequence that is essentially 

20 complementary to a region of desirably at least 20 nucleotides of the target nucleic acid. 

In desirable embodiments of the therapeutic methods of the above aspects, the mammal is 
a human. In some embodiments, the introduced nucleic acid is single stranded or double 
stranded. 

With respect to the therapeutic methods of the invention, it is not intended that the 
25 administration of nucleic acids to a mammal be limited to a particular mode of administration, 
dosage, or frequency of dosing; the present invention contemplates all modes of administration, 
including oral, intraperitoneal, intramuscular, intravenous, intraarticular, intralesional, 
subcutaneous, or any other route sufficient to provide a dose adequate to prevent or treat a 
disease (e.g., a disease associated with the expression of a target nucleic acid that is silenced with 
30 a nucleic acid of the invention). One or more nucleic acids may be administered to the mammal 
in a single dose or multiple doses. When multiple doses are administered, the doses may be 
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separated from one another by, for example, one week, one month, one year, or ten years. It is to 
be understood that, for any particular subject, specific dosage regimes should be adjusted over 
time according to the individual need and the professional judgment of the person administering 
or supervising the administration of the compositions. 
5 Exemplary mammals that can be treated using the methods of the invention include 

humans, primates such as monkeys, animals of veterinary interest (e.g., cows, sheep, goats, 
buffalos, and horses), and domestic pets (e.g., dogs and cats). Exemplary cells in which one or 
more target genes can be silenced using the methods of the invention include invertebrate, plant, 
bacteria, yeast, and vertebrate (e.g., mammalian or human) cells. 

10 Optimum dosages for gene silencing applications may vary depending on the relative 

potency of individual oligonucleotides, and can generally be estimated based on EC50 values 
found to be effective in in vitro and in vivo animal models. In general, dosage is fi-om 0.001 |ig 
to 100 g per kg of body weight (e.g., 0.001 |ig/kg to 1 g/kg), and may be given once or more 
daily, weekly, monthly or yearly, or even once every 2 to 20 years (U.S.P.N. 6,440,739). 

1 5 Persons of ordinary skill in the art can easily estimate repetition rates for dosing based on 

measured residence times and concentrations of the drug in bodily fluids or tissues. Following 
successful treatment, it may be desirable to have the patient undergo maintenance therapy to 
prevent the recurrence of the disease state, wherein the oligonucleotide is administered in 
maintenance doses, ranging from 0.001 ug to 100 g per kg of body weight (e.g., 0.001 |ig/kg to 1 

20 g/kg), once or more daily, to once every 20 years. If desired, conventional treatments may be 
used in combination with the nucleic acids of the present invention. 

Suitable carriers include, but are not limited to, saline, buffered saline, dextrose, water, 
glycerol, ethanol, and combinations thereof The composition can be adapted for the mode of 
administration and can be in the form of, for example, a pill, tablet, capsule, spray, powder, or 

25 liquid. In some embodiments, the pharmaceutical composition contains one or more 

pharmaceutically acceptable additives suitable for the selected route and mode of administration. 
These compositions may be administered by, without limitation, any parenteral route including 
intravenous, intra-arterial, intramuscular, subcutaneous, intradermal, intraperitoneal, intrathecal, 
as well as topically, orally, and by mucosal routes of delivery such as intranasal, inhalation, 

30 rectal, vaginal, buccal, and sublingual. In some embodiments, the pharmaceutical compositions 
of the invention are prepared for administration to vertebrate (e.g., mammalian) subjects in the 
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form of liquids, including sterile, non-pyrogenic liquids for injection, emulsions, powders, 
aerosols, tablets, capsules, enteric coated tablets, or suppositories. 
Exemplary Oligomers of the Invention and Methods for Synthesizing Them 

In desirable embodiments, the invention features a method of synthesizing a nucleic acid. 
5 This method involves synthesizing a 2-thio-uridine nucleoside or nucleotide of formula IV using 
a compound of formula VIII, IX, X, XI, or XII as shown below. The nucleoside, nucleoside 
phosphoramidite, or nucleotide is incorporated into a nucleic acid of the invention. 


o 



10 In a particular embodiment, nucleobase thiolation is performed on the 02 position 

of compound XI to form compound IV. In another embodiment, sulphurization on both 
02 and 04 in compound VIII generates a 2,4-dithio-uridine nucleoside or nucleotide of 
formula X which is converted into compound IV. In yet another embodiment, a cyclic 
ether of formula XI is transferred into compound IV or a 2-0-alkyl-uridine nucleoside or 

1 5 nucleotide of formula XII through reaction with the 5' position. In other embodiments. 
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a 2-0-alkyl-uridine nucleoside or nucleotide of formula XII is generated by direct 
alkylation of a uridine nucleoside or nucleotide of formula VIII. 

In desirable embodiments R"^ and are each independently alkyl (e.g., methyl or ethyl), 
acyl (e.g., acetyl or benzoyl), or any appropriate protecting group such as silyl, 4,4'- 
5 dimethoxytrityl, monomethoxytrityl, or trityl(triphenylmethyl). R^ is any appropriate protecting 
group such as silyl, 4,4'-dimethoxytrityl, monomethoxytrityl, trityl(triphenylmethyl), acetyl, 
benzoyl, or benzyl. In desirable embodiments, R^ is hydrogen, alkyl (e.g., methyl or ethyl), 1- 
propynyl, thiazol-2-yl, pyridine-2-yl, thien-2-yl, imidazol-2-yl, (4/5-methyl)-thiazol-2-yl, 3- 
(iodoacetamido)propyl, 4-[//,A^-bis(3-aminopropyl)amino]butyl), or halo (e.g., chloro, bromo, 
10 iodo, fluoro). 

The group -OR^ in the formulas IV, VIII, IX, X, XI, and XII is any of the groups listed 
for R^ or R^ in formula la or formula lb or listed for R^ or R^* in formula Ila, Scheme A, or 
Scheme B, or the group -OR^ orR^' in the formulas IV, VIII, IX, X, XI, and XII is selected from 
the group consisting of H, -OH, P(0(CH2)2CN)N(iPr)2, P(0(CH2)2CN)N(iPr)2, phosphate, 

1 5 phosphorothioate, phosphorodithioate, phosphoramidate, phosphoroselenoate, 

phosphorodiselenoate, alkylphosphotriester, methyl phosphonate, halo (e.g., chloro, fluoro, iodo, 
or bromo), optionally substituted aryl, (eg., phenyl or benzyl), alkyl (eg, methyl or ethyl), 
alkoxy (eg., methoxy), acyl (eg., acetyl or benzoyl), aroyl, aralkyl, hydroxy, hydroxyalkyl, 
alkoxy, aryloxy, aralkoxy, nitro, cyano, carboxy, alkoxycarbonyl, aryloxycarbonyl, 

20 aralkoxycarbonyl, acylamino, aroylamine, alkylsulfonyl, arylsulfonyl, heteroarylsulfonyl, 
alkylsulfinyl, arylsulfinyl, heteroarylsulfmyl, alkylthio, arylthio, heteroarylthio, aralkylthio, 
heteroaralkylthio,amidino, amino, carbamoyl, sulfamoyl, alkene, alkyne, protecting groups (e.g., 
silyl, 4,4'-dimethoxytrityl, monomethoxytrityl, or trityl(triphenylmethyl)), linkers (e.g., a linker 
containing an amine, ethylene glycol, quinone such as anthraquinone), detectable labels (e.g., 

25 radiolabels or fluorescent labels), and biotin. 

The group -OR^ in the formulas IV, and VIII, IX, X, and XII is any of the groups listed 
for R^ or R^ in formula la or formula lb or listed for R^ or R^* in formula Ila, Scheme A, or 
Scheme B, or the group -OR^ orR^ in the formulas IV, and VIII, IX, X, and XII is selected 
from the group consisting of H, -OH, P(0(CH2)2CN)N(iPr)2, P(0(CH2)2CN)N(iPr)2, phosphate, 

30 phosphorothioate, phosphorodithioate, phosphoramidate, phosphoroselenoate, 

phosphorodiselenoate, alkylphosphotriester, methyl phosphonate, halo (eg., chloro, fluoro, iodo. 


27 


or bromo), optionally substituted aryl, (e.g., phenyl or benzyl), alkyl {e.g, methyl or ethyl), 
alkoxy (e.g., methoxy), acyl (e.g. acetyl or benzoyl), aroyl, aralkyl, hydroxy, hydroxyalkyl, 
alkoxy, aryloxy, aralkoxy, nitro, cyano, carboxy, alkoxycarbonyl, aryloxycarbonyl, 
aralkoxycarbonyl, acylamino, aroylamine, alkylsulfonyl, arylsulfonyl, heteroarylsulfonyl, 
alkylsulfinyl, arylsulfinyl, heteroarylsulfinyl, alkylthio, arylthio, heteroarylthio, aralkylthio, 
heteroaralkylthio,amidino, amino, carbamoyl, sulfamoyl, alkene, alkyne, protecting groups (e.g., 
silyl, 4,4*-dimethoxytrityl, monomethoxytrityl, or trityl(triphenylmethyl)), linkers (e.g., a linker 
containing an amine, ethylene glycol, quinone such as anthraquinone), detectable labels (e.g., 
radiolabels or fluorescent labels), and biotin. 

In yet another aspect, the invention features a method of synthesizing a nucleic acid. 
This method involves synthesizing a 2-thiopyrimidine nucleoside or nucleotide of formula IV 
using a compound of formula III or compounds of the formula I, II, and III as shown below. 
The nucleoside, nucleoside phosphoramidite, or nucleotide is incorporated into a nucleic acid of 
the invention. 


O 



IV 

In some embodiments, lewis acid-catalyzed condensation of a substituted sugar of 
formula I and a substituted 2-thio-uracil of formula II results in a substituted 2-thio-uridine 
nucleoside or nucleotide of the formula III. In some embodiments, a compound of formula III is 
converted into a LNA 2-thiouridine nucleoside or nucleotide of formula IV. 
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In desirable embodiments R"* and are, e.g., methanesulfonyloxy, p- 
toluenesulfonyloxy, or any appropriate protecting group such as silyl, 4,4'-dimethoxytrityl, 
monomethoxytrityl, trityl(triphenylmethyl), acetyl, benzoyl, or benzyl, R^ is, e.g., acetyl, 
benzoyl, alkoxy (e.g., methoxy). R^ is, e.g.,acetyl or benzoyl, and R^ is any appropriate 
5 protecting group such as silyl, 4,4 -dimethoxytrityl, monomethoxytrityl, trityl(triphenylmethyl), 
acetyl, or benzoyl. In desirable embodiments, R^ is hydrogen, alkyl (e.g. methyl or ethyl), 1- 
propynyl, thiazol-2-yl, pyridine-2-yl, thien-2-yl, imidazol-2-yl, (4/5-methyl)-thiazol-2-yl, 3- 
(iodoacetamido)propyl, 4-[A^//-bis(3-aminopropyl)amino]butyl), or halo (e.g., chloro, bromo, 
iodo, fluoro). 

10 The group -OR"^ in the formulas I, III, and IV is any of the groups listed for R^ or R^ in 

formula la or formula lb or listed for R^ or R^* in formula Ila, Scheme A, or Scheme B, or the 
group -OR^ orR^ in the formulas!, Ill, and IV is selected from the group consisting of H, -OH, 
P(0(CH2)2CN)N(iPr)2, phosphate, phosphorothioate, phosphorodithioate, phosphoramidate, 
phosphoroselenoate, phosphorodiselenoate, alkylphosphotriester, methyl phosphonate, halo (e.g., 

15 chloro, fluoro, iodo, or bromo), optionally substituted aryl, (e.g., phenyl or benzyl), alkyl (e.g. 
methyl or ethyl), alkoxy (e.g., methoxy), acyl (e.g. acetyl or benzoyl), aroyl, aralkyl, hydroxy, 
hydroxyalkyl, alkoxy, aryloxy, aralkoxy, nitro, cyano, carboxy, alkoxycarbonyl, 
aryloxycarbonyl, aralkoxycarbonyl, acylamino, aroylamine, alkylsulfonyl, arylsulfonyl, 
heteroarylsulfonyl, alkylsulfinyl, arylsulfinyl, heteroarylsulfinyl, alkylthio, arylthio, 

20 heteroarylthio, aralkylthio, heteroaralkylthio,amidino, amino, carbamoyl, sulfamoyl, alkene, 
alkyne, protecting groups (e.g., silyl, 4,4*-dimethoxytrityl, monomethoxytrityl, or 
trityl(triphenylmethyl)), linkers (e.g., a linker containing an amine, ethylene glycol, quinone such 
as anthraquinone), detectable labels (e.g., radiolabels or fluorescent labels), and biotin. 

The group R^ in the formulas I, III, and IV is any of the groups listed for R^ or R^ in 

25 formula la or formula lb or listed for R^ or R^* in formula Ila, Scheme A, or Scheme B, or R^ in 
the formulas I, III, and IV is selected from the group consisting of H, -OH, 
P(0(CH2)2CN)N(iPr)2, phosphate, phosphorothioate, phosphorodithioate, phosphoramidate, 
phosphoroselenoate, phosphorodiselenoate, alkylphosphotriester, methyl phosphonate, halo (e.g., 
chloro, fluoro, iodo, or bromo), optionally substituted aryl, (e.g., phenyl or benzyl), alkyl (e.g, 

30 methyl or ethyl), alkoxy (e.g., methoxy), acyl (e.g. acetyl or benzoyl), aroyl, aralkyl, hydroxy, 
hydroxyalkyl, alkoxy, aryloxy, aralkoxy, nitro, cyano, carboxy, alkoxycarbonyl. 
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aryloxycarbonyl, aralkoxycarbonyl, acylamino, aroylamine, alkylsulfonyl, arylsulfonyl, 
heteroarylsulfonyl, alkylsulfinyl, arylsulfinyl, heteroarylsulfinyl, alkylthio, arylthio, 
heteroarylthio, aralkylthio, heteroaralkylthio,amidino, amino, carbamoyl, sulfamoyl, alkene, 
alkyne, protecting groups (e.g., silyl, 4,4'-'dimethoxytrityl, monomethoxytrityl, or 
5 trityl(triphenylmethyl)), linkers (e.g., a linker containing an amine, ethylene glycol, quinone such 
as anthraquinone), detectable labels (e.g., radiolabels or fluorescent labels), and biotin. 

In still another aspect, the invention features a method of synthesizing a nucleic acid. 
This method involves synthesizing a 2-thiopyrimidine nucleoside or nucleotide of formula IV 
using a compound of formula VII, compounds of the formula V, VI, and VII, or compounds of 
10 the formula I, V, VI, and VII as shown below. The nucleoside, nucleoside phosphoramidite, or 
nucleotide is incorporated into a nucleic acid of the invention. 



VII 


In some embodiments, a 2-thio-uridine nucleoside or nucleotide of the formula IV is 
1 5 synthesized through ring- synthesis of the nucleobase by reaction of an amino sugar of the 
formula V and a substituted isothiocyanate of the formula VI. 

In desirable embodiments, R"* and are each idenpendently, e.g., methanesulfonyloxy, 
p-toluenesulfonyloxy, or any appropriate protecting group such as silyl, 4,4*-dimethoxytrityl, 
monomethoxytrityl, trityl(triphenylmethyl), acetyl, benzoyl, or benzyl. R^ is, e.g., acetyl or 
20 benzoyl or alkoxy (e.g., methoxy), and R^ is, e.g., acetyl or benzoyl, R^ is any appropriate 
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protecting group such as silyl, 4,4'-dimethoxytrityl, monomethoxytrityl, trityl(triphenylmethyl), 
acetyl, or benzoyl. are R^each idenpendently, e.g., hydrogen or alkyl (e.g. methyl or ethyl). 
R^can also be, e.g., an appropriate protecting group such as silyl, 4,4*-dimethoxytrityl, 
monomethoxytrityl, or trityl(triphenylmethyl). In desirable embodiments, R^ is hydrogen or 
5 methyl, and R^ is methyl or ethyl. 

The group -OR^ in the formulas I, V, VII, and IV is any of the groups listed for R^ or R^ 
in formula la or formula lb or listed for R^ or R^* in formula Ila, Scheme A, or Scheme B, or the 
group -OR^ or R^' in the formulas I, V, VII, and IV is selected from the group consisting of H, r 
OH, P(0(CH2)2CN)N(iPr)2, phosphate, phosphorothioate, phosphorodithioate, phosphoramidate, 

10 phosphoroselenoate, phosphorodiselenoate, alkylphosphotriester, methyl phosphonate, halo (e g,, 
chloro, fluoro, iodo, or bromo), optionally substituted aryl, (e.g., phenyl or benzyl), alkyl (e.g, 
methyl or ethyl), alkoxy (e.g., methoxy), acyl (eg. acetyl or benzoyl), aroyl, aralkyl, hydroxy, 
hydroxyalkyl, alkoxy, aryloxy, aralkoxy, nitro, cyano, carboxy, alkoxycarbonyl, 
aryloxycarbonyl, aralkoxycarbonyl, acylamino, aroylamine, alkylsulfonyl, arylsulfonyl, 

15 heteroarylsulfonyl, alkylsulfinyl, arylsulfinyl, heteroarylsulfinyl, alkylthio, arylthio, 

heteroarylthio, aralkylthio, heteroaralkylthio,amidino, amino, carbamoyl, sulfamoyl, alkene, 
alkyne, protecting groups (e.g., silyl, 4,4'-dimethoxytrityl, monomethoxytrityl, or 
trityl(triphenylmethyl)), linkers (e.g., a linker containing an amine, ethylene glycol, quinone such 
as anthraquinone), detectable labels (e.g., radiolabels or fluorescent labels), and biotin. 

20 R^' in the formulas I, V, VII, and IV is any of the groups listed for R^ or R^ in formula la 

or formula lb or listed for R^ or R^* in formula Ila, Scheme A, or Scheme B, or R^ in the 
formulas I, V, VII, and IV is selected from the group consisting of H, -OH, 
P(0(CH2)2CN)N(iPr)2, phosphate, phosphorothioate, phosphorodithioate, phosphoramidate, 
phosphoroselenoate, phosphorodiselenoate, alkylphosphotriester, methyl phosphonate, halo (e.g., 

25 chloro, fluoro, iodo, or bromo), optionally substituted aryl, (e.g., phenyl or benzyl), alkyl (e.g, 
methyl or ethyl), alkoxy (e.g., methoxy), acyl (e.g. acetyl or benzoyl), aroyl, aralkyl, hydroxy, 
hydroxyalkyl, alkoxy, aryloxy, aralkoxy, nitro, cyano, carboxy, alkoxycarbonyl, 
aryloxycarbonyl, aralkoxycarbonyl, acylamino, aroylamine, alkylsulfonyl, arylsulfonyl, 
heteroarylsulfonyl, alkylsulfinyl, arylsulfinyl, heteroarylsulfinyl, alkylthio, arylthio, 

30 heteroarylthio, aralkylthio, heteroaralkylthio,amidino, amino, carbamoyl, sulfamoyl, alkene, 
alkyne, protecting groups (e.g., silyl, 4,4'-dimethoxytrityl, monomethoxytrityl, or 
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trityl(triphenylmethyl)), linkers (e.g., a linker containing an amine, ethylene glycol, quinone such 
as anthraquinone), detectable labels (e.g., radiolabels or fluorescent labels), and biotin. 

In another aspect, the invention features a method of synthesizing a nucleic acid. This 
method involves synthesizing a 2-thiopyrimidine nucleoside as shown below. In desirable 
5 embodiments, the method further comprises reacting one or both compounds of the formula 4 
with a phosphodiamidite (e.g., 2-cyanoethyl tetraisopropylphosphorodiamidite) to produce the 
corresponding nucleoside phosphoramidite. The nucleoside, nucleoside phosphoramidite, or 
nucleotide is incorporated into a nucleic acid of the invention. 



R30- 


10 

In some embodiments, a glycosyl-donor is coupled to a nucleobase as shown in pathway 
A. In other embodiments, ring synthesis of the nucleobase is performed as show in pathway B. 
In still other embodiments, LNA-T diol is modified as shown in pathway C. 

In desirable embodiments, R is hydrogen, methyl, 1-propynyl, thiazol-2-yl, pyridine-2- 
15 yl, thien-2-yl, imidazol-2-yl, (4/5-methyl)-thiazol-2-yl, 3-(iodoacetamido)propyl, 4-[//,//-bis(3- 
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aminopropyl)amino]butyl, or halo (e.g., chloro, bromo, iodo, fluoro). Desirably, Ri, R2, and R3 
are each any appropriate protecting group such as acetyl, benzyl, silyl, 4,4'-dimethoxytrityl, 
monomethoxytrityl, or trityl(triphenylmethyl). 

In another aspect, the invention features a method of synthesizing a nucleic acid. This 
5 method involves synthesizing a 2-thiopyrimidine nucleoside or nucleotide of formula 4 using a 
compound of formula 3, compounds of the formula 2 and 3, or compounds of the formula 1, 2, 3, 
and 4 as shown below. The nucleoside, nucleoside phosphoramidite, or nucleotide is 
incorporated into a nucleic acid of the invention. This method can also be performed using any 
other appropriate protecting groups instead of Bn (benzyl), Ac (acetyl), or Ms (methansulfonyl). 



In desirable embodiments, the method further comprises reacting one or both compounds 
of the formula 4 with a phosphodiamidite (e.g., 2-cyanoethyl tetraisopropylphosphorodiamidite) 
to produce the corresponding nucleoside phosphoramidite. 

In another aspect, the invention features a method of synthesizing a nucleic acid. This 
15 method involves synthesizing a nucleoside or nucleotide of formula 10 or 11 using a compound 
of any one of the formula 6-9, compounds of the formula 5 and any one of the formulas 6-9, or 
compounds of the formula 4, 5, and any one of the formulas 6-9 as shown below. The 
nucleoside, nucleoside phosphoramidite, or nucleotide is incorporated into a nucleic acid of the 
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invention. This method can also be performed using any other appropriate protecting groups 
instead of DMT, Bn, Ac, or Ms. 

0 

MsO-> MsO-. N^N^ 


MsO 


OBn OAc OBn OAc 

4 5 


b 


BnO "° 

CI 6: R= Ms _ I — 10: r,= h 


I — o: K= MS I — 10: R,= 
L=7.R=Bz ^L^11:r,= 


8: R= H 


P(0(CH2)2CN)N(iPr)2 


9: R = DMT 


5 In some embodiments, a compound of formula 4 is used as a glycosyl donor in a coupling 

reaction with silylated hypoxantine to form a compound of the formula 5. In certain 
embodiments, a compound of the formula 5 is used in a ring closing reaction to forma compound 
of the formula 6. Desirably, deprotection of the 5 -hydroxy group of compound 6 is performed 
by displacing the 5'-0-mesyl group with sodium benzoate to produce a compound of the formula 

10 7 that is converted into a compound of the formula 8 after saponification of the 5*-benzoate. In 
some embodiments, compound 8 is converted to a DMT-protected compound 9 prior to 
debenzylation of the 3*-(9-hydroxy group. In desirable embodiments, a phosphoramidite of the 
formula 11 is generated by phosphitylationof a nucleoside of the formula 10. 

In desirable embodiments, the Ri is H or P(0(CH2)2CN)N(iPr)2. In other embodiments, 

1 5 the group Ri or -OR] is any of the groups listed for R^ or R^ in formula la or formula lb or listed 
for R^ or R^* in formula Ila, Scheme A, or Scheme B, or the group -ORi orRi is selected from the 
group consisting of-OH, P(0(CH2)2CN)N(iPr)2, phosphate, phosphorothioate, 
phosphorodithioate, phosphoramidate, phosphoroselenoate, phosphorodiselenoate, 
alkylphosphotriester, methyl phosphonate, halo (eg., chloro, fluoro, iodo, or bromo), optionally 

20 substituted aryl, (e.g., phenyl or benzyl), alkyl (e.g, methyl or ethyl), alkoxy (e.g., methoxy), acyl 
(e.g. acetyl or benzoyl), aroyl, aralkyl, hydroxy, hydroxyalkyl, alkoxy, aryloxy, aralkoxy, nitro, 
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cyano, carboxy, alkoxycarbonyl, aryloxycarbonyl, aralkoxycarbonyl, acylamino, aroylamine, 
alkylsulfonyl, arylsulfonyl, heteroarylsulfonyl, alkylsulfinyl, arylsulfinyl, heteroarylsulfinyl, 
alkylthio, arylthio, heteroarylthio, aralkylthio, heteroaralkylthio,amidino, amino, carbamoyl, 
sulfamoyl, alkene, alkyne, protecting groups (e.g., silyl, 4,4'-dimethoxytrityl, 
5 monomethoxytrityl, or trityl(triphenylmethyl)), linkers (e.g., a linker containing an amine, 
ethylene glycol, quinone such as anthraquinone), detectable labels (e.g., radiolabels or 
fluorescent labels), and biotin. 

In another aspect, the invention features a method of synthesizing a nucleic acid. This 
method involves synthesizing a nucleoside or nucleotide of formula 20 or 21 as shown below, in 
10 which compound 4 is the same sugar shown in the above aspect. The nucleoside, nucleoside 

phosphoramidite, or nucleotide is incorporated into a nucleic acid of the invention. This method 
can also be performed using any other appropriate protecting groups instead of DMT, Bn, Bz 
(benzoyl), Ac, or Ms. Additionally, the method can be performed with any other halogen (e.g., 
fluoro or bromo) instead of chloro. 
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,P-0(CH2)2CN 

(Pr')2N 


In desirable embodiments to promote the ring closing reaction, a solution of compound 
14 in aqueous 1 ,4-dioxane is treated with sodium hydroxide to give a bicyclic compound 15. In 
5 some embodiments, sodium benzoate is used for displacement of 5 '-mesylate of compound 15 to 
give compound 16. In some embodiments, compound 17 is formed by reaction of compound 16 
with sodium azide. In some embodiments, compound 18 is produced by saponification of the 5'- 
benzoate of compound 17. In certain embodiments, hydrogenation of compound 18 produces 
compound 19. In certain embodiments, the peracelation method is used to benzolylate the 2- 
10 and 6-amino groups of compound 19, yielding 20, which is desirably converted into the 
phosphoramidite compound 21. 

In a related aspect, the invention features a derivative of a compound of the formula 20 or 
21 as described in the above aspect in which 3' -OH or -OP(0(CH2)2CN)N(iPr)2 group is 
replaced by any other group is selected from the group consisting of phosphorothioate, 
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phosphorodithioate, phosphoramidate, phosphoroselenoate, phosphorodiselenoate, 
alkylphosphotriester, methyl phosphonate, halo (e,g., chloro, fluoro, iodo, or bromo), optionally 
substituted aryl, (e.g., phenyl or benzyl), alkyl (e.g, methyl or ethyl), alkoxy (eg., methoxy), acyl 
(e.g. acetyl or benzoyl), aroyl, aralkyl, hydroxy, hydroxyalkyl, alkoxy, aryloxy, aralkoxy, nitro, 

5 cyano, carboxy, alkoxycarbonyl, aryloxycarbonyl, aralkoxycarbonyl, acylamino, aroylamine, 
alkylsulfonyl, arylsulfonyl, heteroarylsulfonyl, alkylsulfinyl, arylsulfinyl, heteroarylsulfinyl, 
alkylthio, arylthio, heteroarylthio, aralkylthio, heteroaralkylthiOjamidino, amino, carbamoyl, 
sulfamoyl, alkene, alkyne, protecting groups (e.g., silyl, 4,4*-dimethoxytrityl, 
monomethoxytrityl, or trityl(triphenylmethyl)), linkers (e.g., a linker containing an amine, 

10 ethylene glycol, quinone such as anthraquinone), detectable labels (e.g., radiolabels or 
fluorescent labels), and biotin. 



NH2 


NHPac 



R 
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In yet another aspect, the invention features a method of synthesizing a nucleic acid. 
This method involves synthesizing a nucleoside or nucleotide of formula 20 or 21 as shown 
below. The nucleoside, nucleoside phosphoramidite, or nucleotide is incorporated into a nucleic 
acid of the invention. This method can also be performed using any other appropriate protecting 
5 groups instead of DMT. 

hi some embodiments, compound 17 is formed by reaction of compound 7 with 1,3- 
dichloro-1 ,1 ,3,3-tetraisopropyldisiloxane. Desirably, compound 18 is formed by reaction of 
compound 17 with phenoxyacetic anhydride. In some embodiments, compound 19 is generated 
by reaction of compound 18 with acid. Desirably, compound 20 is produced by reacting 
10 compound 19 with DMT-Cl. In desirably embodiments, compound 20 is reacted with 2- 
cyanoethyl tetraisopropylphosphorodiamidite to give the phosphoramidite 21. 

In desirable embodiments, the R is H or P(0(CH2)2CN)N(iPr)2. In other embodiments, 

the R or -OR is any of the groups listed for R^ or R^ in formula la or formula lb or listed for R^ or 

'iff 

R in formula Ila, Scheme A, or Scheme B, or the group -OR or R is selected from the group 

1 5 consisting of-OH, P(0(CH2)2CN)N(iPr)2, phosphate, phosphorothioate, phosphorodithioate, 
phosphoramidate, phosphoroselenoate, phosphorodiselenoate, alkylphosphotriester, methyl 
phosphonate, halo (e.g., chloro, fluoro, iodo, or bromo), optionally substituted aryl, (eg., phenyl 
or benzyl), alkyl (eg, methyl or ethyl), alkoxy (eg., methoxy), acyl (eg. acetyl or benzoyl), 
aroyl, aralkyl, hydroxy, hydroxyalkyl, alkoxy, aryloxy, aralkoxy, nitro, cyano, carboxy, 

20 alkoxycarbonyl, aryloxycarbonyl, aralkoxycarbonyl, acylamino, aroylamine, alkylsulfonyl, 

arylsulfonyl, heteroarylsulfonyl, alkylsulfinyl, arylsulfinyl, heteroarylsulfmyl, alkylthio, arylthio, 
heteroarylthio, aralkylthio, heteroaralkylthio,amidino, amino, carbamoyl, sulfamoyl, alkene, 
alkyne, protecting groups (e.g., silyl, 4,4 -dimethoxytrityl, monomethoxytrityl, or 
trityl(triphenylmethyl)), linkers (e.g., a linker containing an amine, ethylene glycol, quinone such 

25 as anthraquinone), detectable labels (e.g., radiolabels or fluorescent labels), and biotin. 

In yet another aspect, the invention features a method of synthesizing a nucleic acid. 
This method involves synthesizing a nucleoside or nucleotide of formula 24 or 25 as shown 
below. The nucleoside, nucleoside phosphoramidite, or nucleotide is incorporated into a nucleic 
acid of the invention. This method can also be performed using any other appropriate protecting 

30 groups instead of Bz, Bn, and DMT. Additionally, the method can be performed with any other 
halogen (e.g., fluoro or bromo) instead of chloro. 
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CI 



,P-0(CH2)2CN 

(Pr')2N 


In some embodiments, the compound 16 is formed from compounds 4, 14, and 15 as 
illustrated in an aspect above. Desirably, the 5'-0-benzoyl group of compound 16 is hydrolyzed 
5 by aqueous sodium hydroxyde to give compound 22. Compound 23 is desirably produced by 
incubation of compound 22 in the presence of paladium hydroxide and ammonium formate. . 
Desirably, the 2-amine of compound 23 is selectively protected with an amidine group after 
treatment with iV,iV-dimethylformamide dimethyl acetal to yield compound 24. hi some 
embodiments, the diol 24 is 5'-0-DMT protected and 3'-(9-phosphitylated produce the 

10 phosphoramidite LNA-2AP compound 25. 

In some embodiments, compound 25 has one of the following groups instead of the 
P(0(CH2)2CN)N(iPr)2 group: any of the groups listed for or in formula la or formula lb or 
listed for R^ or R^* in formula Ila, Scheme A, or Scheme B, or a group selected from the group 
consisting of-OH, phosphate, phosphorothioate, phosphorodithioate, phosphoramidate, 

15 phosphoroselenoate, phosphorodiselenoate, alkylphosphotriester, methyl phosphonate, halo (e.g., 
chloro, fluoro, iodo, or bromo), optionally substituted aryl, (e.g., phenyl or benzyl), alkyl (e.g, 
methyl or ethyl), alkoxy (e.g., methoxy), acyl (eg acetyl or benzoyl), aroyl, aralkyl, hydroxy, 
hydroxyalkyl, alkoxy, aryloxy, aralkoxy, nitro, cyano, carboxy, alkoxycarbonyl, 
aryloxycarbonyl, aralkoxycarbonyl, acylamino, aroylamine, alkylsulfonyl, arylsulfonyl, 

20 heteroarylsulfonyl, alkylsulfinyl, arylsulfinyl, heteroarylsulfinyl, alkylthio, arylthio, 

heteroarylthio, aralkylthio, heteroaralkylthio,amidino, amino, carbamoyl, sulfamoyl, alkene, 
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alkyne, protecting groups (e.g., silyl, 4,4'-dimethoxytrityl, monomethoxytrityl, or 
trityl(triphenylmethyl)), linkers (e.g., a linker containing an amine, ethylene glycol, quinone such 
as anthraquinone), detectable labels (e.g., radiolabels or fluorescent labels), and biotin. 

In another aspect, the invention features a nucleic acid of the invention that includes a 
5 compound of the formula 6pCor the product of a compound of the formula 6pC treated with 
ammonia as described herein. In a related aspect, the invention features a method of 
synthesizing a nucleic acid that involves performing one or more of the steps described herein 
for the synthesis of a compound of the formula 6pCor the product of a compound of the formula 
6pC treated with ammonia. 

10 In yet another aspect, the invention features a method of synthesizing a nucleic acid. This 

method involves one or more of any of the nucleosides or nucleotides of the invention with (i) 
any other nucleoside or nucleotide of the invention, (ii) any other nucleoside or nucleotide of 
formula la, formula lb, formula Ila, Scheme A, or Scheme B, and/or (iii) any naturally-occurring 
nucleoside or nucleotide. Desirably, the method involves reacting one or more nucleoside 

15 phosphoramidites of any of the above aspects with a nucleotide or nucleic acid. 
Methods for Synthesis of Nucleic Acids on a Solid Support 

In another aspect, the invention provides a method for the synthesis of a population of 
nucleic acids (e.g., a population of nucleic acids of the invention) on a solid support. This 
method involves the reaction of a plurality of nucleoside phosphoramidites with an activated 

20 solid support (e.g., a solid support with an activated linker) and the subsequent reaction of a 

plurality of nucleoside phosphoramidites with activated nucleotides or nucleic acids bound to the 
solid support. 

In some embodiments of any of the above aspects, the solid support or the growing 
nucleic acid bound to the solid support is activated by illumination, a photogenerated acid, or 

25 electric current. In desirable embodiments, one or more spots or regions (e.g., a region with an 
area of less than 1 cm^, 0.1 cm^, 0.01 cm^, 1 mm^, or 0.1 mm^ that desirably contains one 
particular nucleic acid monomer or oligomer) on the solid support are irradiated to produce a 
photogenerated acid that removes the 5*-0H protecting group of one or more nucleic acid 
monomers or oligomers to which a nucleotide is subsequently added. In other embodiments, an 

30 electric current is applied to one or more spots or regions (e.g., a region with an area of less than 
1 cm^, 0.1 cm^, 0.01 cm^, 1 mm^ or 0.1 mm^ that desirably contains one particular nucleic acid 
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monomer or oligomer) on the solid support to remove an electrochemically sensitive protecting 
group of one or more nucleic acid monomers or oligomers to which a nucleotide is subsequently 
added. In still other embodiments, one or more spots or regions (e.g., a region with an area of 
less than 1 cm^, 0.1 cm^, 0.01 cm^, 1 mm^, or 0.1 mm^that desirably contains one particular 
5 nucleic acid monomer or oligomer) on the solid support are irradiated to remove a photosensitive 
protecting group of one or more nucleic acid monomers or oligomers to which a nucleotide is 
subsequently added. In various embodiments, the solid support (e.g., chip, coverslip, microscope 
glass slide, quartz, or silicon) is less than 1, 0.5, 0.1. or 0.05 mm thick. 
Methods for the Synthesis of Nucleic Acids 

10 In another aspect, the invention features a method of reacting a population of nucleic 

acids of the invention with one or more nucleic acids. This method involves incubating an 
immobilized population of nucleic acids of the invention with a solution that includes one or 
more probes (e.g., at least 2, 3, 4, 5, 10, 15, 20, 30, 40, 50, 60, 80, 100, or 150 different nucleic 
acids) and one or more target nucleic acids (e.g., at least 2, 3, 4, 5, 10, 15, 20, 30, 40, 50, 60, 80, 

15 100, or 150 different target nucleic acids). The incubation is performed in the presence of a 

ligase under conditions that allow the ligase to covalently react one or more immobilized nucleic 
acids with one or more nucleic acid probes in solution that hybridize to the same target nucleic 
acid. Desirably, at least 2, 5, 10, 15, 20, 30, 40, 50, 80, or 100 pairs of immobilized nucleic acids 
and nucleic acid probes are ligated. In various embodiments, the incubation occurs between 15 

20 and 45°C, such as between 20 and 40°C or between 25 and 35°C 
Desirable Embodiments of Any of the Aspects of the Invention 

In other embodiments of any of various aspects of the invention, a nucleic acid probe or 
primer specifically hybridizes to a target nucleic acid but does not substantially hybridize to non- 
target molecules, which include other nucleic acids in a cell or biological sample having a 

25 sequence that is less than 99, 95, 90, 80, or 70% identical or complementary to that of the target 
nucleic acid. Desirably, the amount of the these non-target molecules hybridized to, or 
associated with, the nucleic acid probe or primer, as measured using standard assays, is 2-fold, 
desirably 5-fold, more desirably 10-fold, and most desirably 50-fold lower than the amount of 
the target nucleic acid hybridized to, or associated with, the nucleic acid probe or primer. In 

30 other embodiments, the amount of a target nucleic acid hybridized to, or associated with, the 

nucleic acid probe or primer, as measured using standard assays, is 2-fold, desirably 5-fold, more 
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desirably 10-fold, and most desirably 50-fold greater than the amount of a control nucleic acid 
hybridized to, or associated with, the nucleic acid probe or primer. In certain embodiments, the 
nucleic acid probe or primer is substantially complementary (e.g., at least 80, 90, 95, 98, or 
100% complementary) to a target nucleic acid or a group of target nucleic acids from a cell. In 
5 other embodiments, the probe or primer is homologous to multiple RNA or DNA molecules, 
such as RNA or DNA molecules from the same gene family. In other embodiments, the probe or 
primer is homologous to a large number of RNA or DNA molecules. In desirable embodiments, 
the probe or primer binds to nucleic acids which have polynucleotide sequences that differ in 
sequence at a position that corresponds to the position of a universal base in the probe or primer. 

10 Examples of control nucleic acids include nucleic acids with a random sequence or nucleic acids 
known to have little, if any, affinity for the nucleic acid probe or primer. In some embodiments, 
the target nucleic acid is an RNA, DNA, or cDNA molecule. 

Desirably, the association constant (Ka) of the nucleic acid toward a complementary 
target molecule is higher than the association constant of the complementary strands of the 

15 double stranded target molecule. In some desirable embodiments, the melting temperature of a 
duplex between the nucleic acid and a complementary target molecule is higher than the melting 
temperature of the complementary strands of the double stranded target molecule. 

In some embodiments, the LNA-pyrene is in a position corresponding to the position of a 
non-base (e.g., a unit without a base) in another nucleic acid, such as a target nucleic acid. 

20 Incorporation of pyrene in a DNA strand that is hybridized against the four natural bases 

decreases the Tm by -4.5*^C to -6.8°C; however, incorporation of pyrene in a DNA strand in a 
position opposite a non-base only decreases the Tm by -2.3°C to -4.6°C, most likely due to the 
better accommodation of the pyrene in the B-type duplex (Matray and Kool, J. Am. Chem. Soc. 
120, 6191, 1998). Thus, incorporation on LNA-pyrene into a nucleic acid in a position opposite 

25 a non-base (e.g., a unit without a base or a unit with a small group such as a noncyclic group 
instead of a base) in a target nucleic acid may also minimize any potential decrease in Tm due to 
the pyrene substitution. 

In various embodiments, the number of molecules in the plurality of nucleic acids of the 
invention is at least 2, 4, 5, 6, 7, 8, or 10- fold greater than the number of molecules in the test 

30 nucleic acid sample. In some embodiments, a LNA is a triplex-forming oligonucleotide. 
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In desirable embodiments of any of the aspects of the invention, the target nucleic acids 
(e.g., cDNA molecules reverse transcribed from a patient sample or cRNA molecules amplified 
from a patient sample using a T7 RNA polymerase-based amplification system or the like) are 
fragmented using an enzyme such as a uracil-DNA glycosylase (e.g., E, coli uracil-DNA 
5 glycosylase) or using chemical hydrolysis such as alkaline hydrolysis. In various embodiments, 
the average size of the fragmented nucleic acids is between 300 and 50 nucleic acids, such as 
approximately 300, 200, 100, or 50 nucleotides. 
Advantages 

The present invention has a variety of advantages related to nucleic acid analysis 

10 methods. The ability to equalize melting temperatures of a series of nucleic acids is generally 
applicable and desirable in all situations where more than one sequence is used simultaneously 
(e.g. DNA arrays with more than one capture probe, PGR and especially multiplex PGR, 
homogeneous assays such as Taqman and Molecular beacon). Sample preparation of specific 
sequences (e.g., DNA or RNA extraction using capture probes on filters or magnetic beads) is 

1 5 another area where melting temperature equalization of specific probe sequences is usefiil. 

For example, the invention provides high affinity nucleotides (e.g., LNA and other high 
affinity nucleotides with a modified base and/or backbone) that can be used, e.g., arrays of the 
invention. In particular, the nucleic acids of the invention containing LNA units exhibited a 
suprising ability to discriminate between different mRNA splice variants compared to naturally- 

20 occurring nucleic acids. If desired, universal bases can be added as part of flanking regions in 
capture probes (e.g., probes of an array) to stabilize hybridization with high affinity nucleotides 
in the capture probes. Replacement of one or more DNA-t nucleotides with LNA-T and/or 
replacement of one or more DNA-a nucleotides with LNA-A reduces the variability of melting 
temperatures for capture probes of similar length but different GC and AT content by desirably 

25 at least 10, 20, 30, 40 or 50%. Additionally, replacement of one or more DNA-t nucleotides with 
LNA-T and/or replacement of one or more DNA-c with LNA-G increases the stability of a large 
number of capture probes, while desirably avoiding self-complementary sequences with 
LNAiLNA base-pairs within a capture probe that would otherwise reduce or eliminate the 
binding of target molecules to the probe. Although a general T and C substitution may not 

30 reduce the variability of melting temperatures of the probes, this substitution increases the 
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melting temperature and binding efficiency of many capture probes that contain these two 
nucleotides. 

The invention also provides a general substitution algorithm for enhancement of the 
hybridization signal of a test nucleic acid sample by inclusion of high affinity monomers (e.g., 
5 LNA and other high affinity nucleotides with a modified base and/or backbone) in the array. 
This method increases the stability and binding affinity of capture probes while avoiding 
substitutions in positions that may form self-complementary base-pairs which may otherwise 
inhibit binding to a target molecule. The substitution algorithm is broadly usefiil for specialized 
arrays, as well as for PCR primers and FISH probes. 
10 Other features and advantages of the invention will be apparent fi"om the following 

detailed description. 

Definitions 

When used herein, the term "LNA" (Locked Nucleoside Analogues) refers to nucleoside 

15 analogues (e.g., bicyclic nucleoside analogues, e.g., as disclosed in WO 9914226) either 

incorporated in an oligonucleotide or as a discrete chemical species (e.g., LNA nucleoside and 
LNA nucleotide). Furthermore, the term "LNA" includes the compounds, as described in the 
present specificatiion including the compounds described in Example 17. 

The term "monomeric LNA" may, e.g., refer to the monomers LNA A, LNA T, LNA C, 

20 or any other LNA monomers. 

By "LNA unit" is meant an individual LNA monomer (e.g., an LNA nucleoside or LNA 
nucleotide). or an oligomer (e.g., an oligonucleotide or nucleic acid) that includes at least one 
LNA monomer. LNA units as disclosed in WO 99/14226 are in general particularly desirable 
modified nucleic acids for incorporation into an oligonucleotide of the invention. Additionally, 

25 the nucleic acids may be modified at either the 3' and/or 5' end by any type of modification 
known in the art. For example, either or both ends may be capped with a protecting group, 
attached to a flexible linking group, attached to a reactive group to aid in attachment to the 
substrate surface, etc. Desirable LNA units and their method of synthesis also are disclosed in 
WO 0056746, WO 0056748, WO 0066604, Morita et a!., Bioorg. Med. Chem. Lett. 12(l):73-76, 

30 2002; Hakansson et ah, Bioorg. Med. Chem. Lett. 1 1(7):935-938, 2001; Koshkin et ai, J. Org. 
Chem. 66(25):8504-8512, 2001; Kvaemo et aL, J. Org. Chem. 66(16):5498-5503, 2001; 
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Hakansson et al, J. Org. Chem. 65(17):5161-5166, 2000; Kvaemo et al, J. Org. Chem. 
65(17):5167-5176, 2000; Pfiindhellere/ a/., Nucleosides Nucleotides 18(9):20 17-2030, 1999; 
and Kumar et aL, Bioorg. Med. Chem. Lett. 8(16):2219-2222, 1998. 

By "LNA modified oligonucleotide" is meant a oligonucleotide comprising at least one 
5 LNA monomeric unit of the general scheme A, described infra, having the below described 
illustrative examples of modifications: 


10 wherein X is selected from -0-, -S-, -N(R^)-, -C(R^R^*)-, -0-C(RV*)-, -C(R^R^*)-0-, -S- 
C(R^R^*)-, -C(R^R^*)-S-, -N(R^*)-C(R^R^*)-, -C(R^R^*)-N(R^*)-, and -C(R^R^*)-C(R^R^*). 


B is selected from a modified base as discussed above e.g. an optionally substituted 
carbocyclic aryl such as optionally substituted pyrene or optionally substituted 
pyrenylmethylglycerol, or an optionally substituted heteroalicylic or optionally substituted 

15 heteroaromatic such as optionally substituted pyridyloxazole, optionally substituted pyrrole, 
optionally substituted diazole or optionally substituted triazole moieties; hydrogen, hydroxy, 
optionally substituted CM-alkoxy, optionally substituted CM-alkyl, optionally substituted C^- 
acyloxy, nucleobases, DNA intercalators, photochemically active groups, thermochemically 
active groups, chelating groups, reporter groups, and ligands. 

20 P designates the radical position for an intemucleoside linkage to a succeeding monomer, 

or a 5 -terminal group, such intemucleoside linkage or 5'-terminal group optionally including the 
substituent R^. One of the substituents R^, R^*, R^, and R^* is a group P* which designates an 
intemucleoside linkage to a preceding monomer, or a 273 '-terminal group. The substituents of 
R^*, R^*, R\ R^ R^*, R^ R^*, R^, and the ones of R^ R^*, R\ and R^* not designating P* 

25 each designates a biradical comprising about 1-8 groups/atoms selected fi-om -C(R^R^)-, - 


C(R>C(R")-, -C(R>N-, -C(R>0-, -0-, -Si(R')2-, -C(R')-S, -S-, -SO2-, -C(R>N(R^)-, - 



A 
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N(R^)-, and >C=Q, wherein Q is selected from -0-, and -N(R^)-, and and each is 
independently selected from hydrogen, optionally substituted Ci.^-alkyl, optionally substituted 
C2.i2-alkenyl, optionally substituted C2-i2-alkynyl, hydroxy, Ci-^-alkoxy, C2-i2-alkenyloxy, 
carboxy, Ci.i2-alkoxycarbonyl, CM2-alkylcarbonyl, formyl, aryl, aryloxy-carbonyl, aryloxy, 
5 arylcarbonyl, heteroaryl, hetero-aryloxy-carbonyl, heteroaryloxy, heteroarylcarbonyl, amino, 
mono- and di(Ci^-alkyl)amino, carbamoyl, mono- and di(Ci.6-alkyl)-amino-carbonyl, amino-Ci. 
6-alkyl-aminocarbonyl, mono- and di(Ci-6-alkyl)amino-Ci.6-alkyl-aminocarbonyl, Ci-6-alkyl- 
carbonyl amino, carbamido, Ci-6-alkanoyloxy, sulphono, Ci.6-alkylsulphonyloxy, nitro, azido, 
sulphanyl, Ci.6-alkylthio, halogen, DNA intercalators, photochemically active groups, 

10 thermochemically active groups, chelating groups, reporter groups, and ligands, where aryl and 
heteroaryl may be optionally substituted, and where two geminal substituents R^ and R^ together 
may designate optionally substituted methylene (=CH2), and wherein two non-geminal or 
geminal substituents selected from R^, R^, and any of the substituents R^*, R^, R^*, R^, R^*, R"^*, 
R^, R^*, R*^ and R^*, R^, and R^* which are present and not involved in P, P* or the biradical(s) 

1 5 together may form an associated biradical selected from biradicals of the same kind as defined 
before; the pair(s) of non-geminal substituents thereby forming a mono- or bicyclic entity 
together with (i) the atoms to which said non-geminal substituents are bound and (ii) any 
. intervening atoms. 

Each of the substituents R**, R^ R^*, R^ R^*, R^ R^*, R^ and R^*, R\ and R^* which are 
20 present and not involved in P, P* or the biradical(s), is independently selected from hydrogen, 
optionally substituted CM2-alkyl, optionally substituted C2-i2-alkenyl, optionally substituted C2- 
12-alkynyl, hydroxy, d.^-alkoxy, C2-i2-alkenyloxy, carboxy, Ci.i2-alkoxycarbonyl, C1.12- 
alkylcarbonyl, formyl, aryl, aryloxy-carbonyl, aryloxy, arylcarbonyl, heteroaryl, heteroaryloxy- 
carbonyl, heteroaryl oxy, heteroarylcarbonyl, amino, mono- and di(Ci.6-alkyl)amino, carbamoyl, 
25 mono- and di(Ci.6-alkyl)-amino-carbonyl, amino-Ci.6-alkyl-aminocarbonyl, mono- and di(Ci.6- 
alkyl)amino-Ci.6-alkyl-aminocarbonyl, Ci.6-alkyl-carbonylamino, carbamido, Ci.6-alkanoyloxy, 
sulphono, Ci-a-alkylsulphonyloxy, nitro, azido, sulphanyl, Ci_6-alkylthio, halogen, DNA 
intercalators, photochemically active groups, thermochemically active groups, chelating groups, 
reporter groups, and ligands, where aryl and heteroaryl may be optionally substituted, and where 
30 two geminal substituents together may designate 0x0, thioxo, imino, or optionally substituted 
methylene, or together may form a spiro biradical consisting of a 1 -5 carbon atom(s) alkylene 
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chain which is optionally interrupted and/or terminated by one or more heteroatoms/groups 
selected from -0-, and -(NR^)- where is selected from hydrogen and CM-alkyl, and 
where two adjacent (non-geminal) substituents may designate an additional bond resulting in a 
double bond; and R^*, when present and not involved in a biradical, is selected from hydrogen 
5 and CM-alkyl; and basic salts and acid addition salts thereof. 

Exemplary 5*, 3', and/or 2* terminal groups include -H, -OH, halo (e.g., chloro, fluoro, 
iodo, or bromo), optionally substituted aryl, (e.g., phenyl or benzyl), alkyl (e.g, methyl or ethyl), 
alkoxy (e.g., methoxy), acyl (e.g. acetyl or benzoyl), aroyl, aralkyl, hydroxy, hydroxyalkyl, 
alkoxy, aryloxy, aralkoxy, nitro, cyano, carboxy, alkoxycarbonyl, aryloxycarbonyl, 

10 aralkoxycarbonyl, acylamino, aroylamine, alkylsulfonyl, arylsulfonyl, heteroarylsulfonyl, 
alkylsulfinyl, arylsulfmyl, heteroarylsulfinyl, alkylthio, arylthio, heteroarylthio, aralkylthio, 
heteroaralkylthio,amidino, amino, carbamoyl, sulfamoyl, alkene, alkyne, protecting groups (e.g., 
silyl, 4,4'-dimethoxytrityl, monomethoxytrityl, or trityl(triphenylmethyl)), linkers (e.g., a linker 
containing an amine, ethylene glycol, quinone such as anthraquinone), detectable labels (e.g., 

1 5 radiolabels or fluorescent labels), and biotin. 

It is understood that references herein to a nucleic acid unit, nucleic acid residue, LNA 
unit, or similar term are inclusive of both individual nucleoside units and nucleotide units and 
nucleoside units and nucleotide units within an oligonucleotide. 

A "modified base" or other similar term refers to a composition (e.g., a non-naturally 

20 occuring nucleobase or nucleosidic base) which can pair with a natural base (e.g., adenine, 
guanine, cytosine, uracil, and/or thymine) and/or can pair with a non-naturally occurring 
nucleobase or nucleosidic base. Desirably, the modified base provides a Tm differential of 15, 
12, 10, 8, 6, 4, or 2®C or less as described herein. Exemplary modified bases are described in EP 
1 072 679 and WO 97/12896. 

25 By ''nucleobase" is meant the naturally occurring nucleobases adenine (A), guanine (G), 

cytosine (C), thymine (T) and uracil (U) as well as non-naturally occurring nucleobases such as 
xanthine, diaminopurine, 8-oxo-N^-methyladenine, 7-deazaxanthine, 7-deazaguanine, N'*,N'*- 
ethanocytosin, N^,N^-ethano-2,6-diaminopurine, 5-methylcytosine (mC), 5-(C^-C^)-alkynyl- 
cytosine, 5-fluorouracil, 5-bromouracil, pseudoisocytosine, 2-hydroxy-5-methyl-4-. 

30 triazolopyridin, isocytosine, isoguanine, inosine and the "non-naturally occurring" nucleobases 
described in Benner et al., U.S. Pat No. 5,432,272 and Susan M. Freier and Karl-Heinz Altmann, 
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Nucleic Acids Research, 1997, vol. 25, pp 4429-4443. The term "nucleobase" thus includes not 
only the known purine and pyrimidine heterocycles, but also heterocyclic analogues and 
tautomers thereof. Further naturally and non-naturally occurring nucleobases include those 
disclosed in U.S. Pat. No. 3,687,808 (Merigan, et al.), in Chapter 15 by Sanghvi, in Antisense 
5 Research and Application, Ed, S. T. Crooke and B. Lebleu, CRC Press, 1993, in Englisch et al., 
Angewandte Chemie, International Edition, 1991, 30, 613-722 (see especially pages 622 and 
623, and in the Concise Encyclopedia of Polymer Science and Engineering, J. L Kroschwitz Ed., 
John Wiley & Sons, 1990, pages 858-859, Cook, Anti-Cancer Drug Design 1991, 6, 585-607, 
each of which are hereby incorporated by reference in their entirety). The term "nucleosidic 

10 base" or "base unit" is further intended to include compounds such as heterocyclic compounds 
that can serve like nucleobases including certain "universal bases" that are not nucleosidic b.ases 
in the most classical sense but serve as nucleosidic bases. Especially mentioned as universal 
bases are 3-nitropyrrole, optionally substituted indoles (e.g., 5-nitroindole), and optionally 
substituted hypoxanthine. Other desirable universal bases include, pyrrole, diazole or triazole 

1 5 derivatives, including those universal bases known in the art. 

As described herein, various groups of an LNA unit may be optionally substituted. A 
"substituted" group such as a nucleobase or nucleosidic base and the like may be substituted by 
other than hydrogen at one or more available positions, typically 1 to 3 or 4 positions, by one or 
more suitable groups such as those disclosed herein. Suitable groups that may be present on a 

20 "substituted" group include e.g. halogen such as fluoro, chloro, bromo and iodo; cyano; 
hydroxyl; nitro; azido; alkanoyl such as a Ci.6 alkanoyl group such as acyl and the like; 
carboxamido; alkyl groups including those groups having 1 to about 12 carbon atoms, or 1, 2, 3, 
4, 5, or 6 carbon atoms; alkenyl and alkynyl groups including groups having one or more 
unsaturated linkages and from 2 to 12 carbon, or 2, 3, 4, 5 or 6 carbon atoms; alkoxy groups 

25 including those having one or more oxygen linkages and from 1 to about 1 2 carbon atoms, or 1 , 
2, 3, 4, 5 or 6 carbon atoms; aryloxy such as phenoxy; alkylthio groups including those moieties 
having one or more thioether linkages and from 1 to about 12 carbon atoms, or 1, 2, 3, 4, 5 or 6 
carbon atoms; alkylsulfmyl groups including those moieties having one or more sulfinyl linkages 
and from 1 to about 12 carbon atoms, or 1, 2, 3, 4, 5, or 6 carbon atoms; alkylsulfonyl groups 

30 including those moieties having one or more sulfonyl linkages and from 1 to about 12 carbon 
atoms, or 1, 2, 3, 4, 5, or 6 carbon atoms; aminoalkyl groups such as groups having one or more 
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N atoms and from 1 to about 12 carbon atoms, or 1 , 2, 3, 4, 5 or 6 carbon atoms; carbocyclic aryl 
having 6 or more carbons; aralkyl having 1 to 3 separate or fijsed rings and from 6 to about 1 8 
carbon ring atoms, with benzyl being a desirable group; aralkoxy having 1 to 3 separate or fused 
rings and from 6 to about 18 carbon ring atoms, with O-benzyl being a desirable group; or a 
5 heteroaromatic or heteroalicyclic group having 1 to 3 separate or fiised rings with 3 to about 8 
members per ring and one or more N, O or S atoms, e.g. coumarinyl, quinolinyl, pyridyl, 
pyrazinyl, pyrimidyl, furyl, pyrrolyl, thienyl, thiazolyl, oxazolyl, imidazolyl, indolyl, 
benzofiiranyl, benzothiazolyl, tetrahydrofuranyl, tetrahydropyranyl, piperidinyl, morpholino and 
pyrrolidinyl. 

10 By "oxy-LNA monomer or unit" is meant any nucleoside or nucleotide which contains an 

oxygen atom in a linkage. 

A "non-oxy-LNA" monomer or unit is broadly defined as any nucleoside or nucleotide 

which does not contain an oxygen atom in a 2-4'- linkage. Examples of non-oxy-LNA 

monomers include 2'-deoxynucleotides (DNA) or nucleotides (RNA) or any analogues of these 
15 monomers which are not oxy-LNA, such as for example the thio-LNA and amino-LNA 

described herein with respect to formula la and in Singh et al J. Org. Chem. 1998, 6, 6078-9, 

and the derivatives described in Susan M. Freier and Karl-Heinz Altmann, Nucleic Acids 

Research, 1997, vol 25, pp 4429-4443. 

By "universal base" is meant a naturally-occurring or desirably a non-naturally occurring 
20 compound or moiety that can pair with a natural base (e.g., adenine, guanine, cytosine, uracil, 

and/or thymine), and that has a Tm differential of 15, 12, 10, 8, 6, 4, or 2^C or less as described 

herein. 

By "oligonucleotide," "oligomer," or "oligo" is meant a successive chain of monomers 
(e.g., glycosides of heterocyclic bases) connected via intemucleoside linkages. The linkage 

25 between two successive monomers in the oligo consist of 2 to 4, desirably 3, groups/atoms 

selected from -CH2-, -0-, -S-, -NR"-, >C=0, >C=NR", >C=S, -Si(R")2-, -SO-, -S(0)2-, -P(0)2-, 
-P0(BH3)-, -P(0,S)-, -P(S)2-, -P0(R")-, -P0(0CH3)-, and -P0(NHR")-, where R" is selected 
from hydrogen and CM-alkyl, and R" is selected from Ci^-alkyl and phenyl. Illustrative 
examples of such linkages are -CH2-CH2-CH2-, -CH2-CO-CH2-, -CH2-CHOH-CH2-, -O-CH2-O-, 

30 -O-CH2-CH2-, -0-CH2-CH= (including R^ when used as a linkage to a succeeding monomer), 
-CH2-CH2-O-, -NR"-CH2-CH2-, -CH2-CH2-NR"-, -CH2-NR"-CH2-, -O-CH2-CH2-NR"-, 
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-nr"-co-o-, -nr"-co-nr"-, -nr"-cs-nr"-, .NR"-C(=NR")-NR"-, -NR"-C0-CH2-NR"-, - 
0-C0-0-, -O-CO-CH2-O., -O-CH2-CO-O-, -CH2-CO-NR"-, -0-CO-NR"-, -NR"-C0-CH2-, - 
O-CH2-CO-NR"-, -O-CH2-CH2-NR"-, -CH=N-0-, -CH2-NR"-0-, -CH2-0-N= (including R^ 
when used as a linkage to a succeeding monomer), -CH2-O-NR"-, -C0-NR"-CH2-, -CH2-NR"- 

5 -CH2-NR"-C0-, -0-NR"-CH2-, -0-NR"«, -O-CH2-S-, -S-CH2-O-, -CH2-CH2-S-, -O-CH2- 

CH2-S-, -S-CH2-CH= (including R^ when used as a linkage to a succeeding monomer), -S-CH2- 
CH2-, -S-CH2-CH2-O-, -S-CH2-CH2-S-, -CH2-S-CH2-, -CH2-SO-CH2-, -CH2-SO2-CH2-, 
-0-S0-0-, -0-S(0)2-0-, -0-S(0)2-CH2-, -0-S(0)2-NR"-, -NR"-S(0)2-CH2-, -0-S(0)2-CH2-, - 
0-P(0)2-0-, -0-P(0,S)-0-, -0-P(S)2-0-, -S-P(0)2-0-, -S-P(0,S)-0-, .S-P(S)2-0., -0-P(0)2-S-, 

10 -0-P(0,S)-S-, -0-P(S)2-S-, .S-P(0)2-S-, -S-P(0,S)-S-, -S-P(S)2-S-, .0-PO(R^O-0-, -o- 
P0(0CH3)-0-, -0-PO(OCH2CH3)-0-, -0-PO(OCH2CH2S-R)-0-, -0-P0(BH3)-0-, -O- 
PO(NHR^)-0-, -0>P(0)2-NR"., .NR"-P(0)2-0-, -0-P(0,NR")-0-, -CH2-P(0)2-0-, -0-P(0)2- 
CH2-, and -0-Si(R")2-0-; among which -CH2-CO-NR"-, -CH2-NR"-0-, -S-CH2-O-, -0-P(0)2-0- 
, .0-P(0,S)-0-, -0-P(S)2-0-, -NR".P(0)2-0-, -0-P(0,NR").0-, -0-PO(R")-0-, -0"P0(CH3)-0-, 

15 and -0-PO(NHR^)-0-, where R" is selected form hydrogen and CM-alkyl, and R" is selected 
from Ci-6-alkyl and phenyl, are especially desirable. Further illustrative examples are given in 
Mesmaeker et. al. Current Opinion in Structural Biology 1995, 5, 343-355 and Susan M. Freier 
and Karl-Heinz Altmann, Nucleic Acids Research, 1997, vol 25, pp 4429-4443. The left-hand 
side of the intemucleoside linkage is bound to the 5-membered ring as substituent P* at the 3'- 

20 position, whereas the right-hand side is bound to the 5 '-position of a preceding monomer. 

By "succeeding monomer" is meant the neighboring monomer in the 5'-terminal 
direction, and by "preceding monomer" is meant the neighboring monomer in the 3'-terminal 
direction. 

By "LNA spiked oligo" is meant an oligonucleotide, such as a DNA oligonucleotide, 
25 wherein at least one unit (and preferably not all units) has been substituted by the corresponding 
LNA nucleoside monomer. 

The term "Tm" is used in reference to the "melting temperature." The melting 
temperature is the temperature at which 50% of a population of double-stranded nucleic acid 
molecules becomes dissociated into single strands. The equation for calculating the Tm of nucleic 
30 acids is well-known in the art. The Tm of a hybrid nucleic acid is often estimated using a formula 
adopted from hybridization assays in 1 M salt, and commonly used for calculating Tm for PGR 
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primers: Tm =[(nuinber of A+T) x 2°C + (number of G+C) x 4*'C]. C. R, Newton et al. PGR, 2nd 
Ed., Springer- Verlag (New York: 1997), p. 24. This formula was found to be inaccurate for 
primers longer that 20 nucleotides. Id. Other more sophisticated computations exist in the art 
which take structural as well as sequence characteristics into account for the calculation of Tm. A 
5 calculated Tm is merely an estimate; the optimum temperature is commonly determined 
empirically. 

A nucleic acid compound that has a Tm differential of a specified amount (e.g., less than 
15, 12, 10, 8, 6, 4, 2, or 1°G) means the nucleic acid exhibits that specified Tm differential when 
incorporated into a specified 9-mer oligonucleotide with respect to the four complementary 
10 variants, as defined immediately below. 

Unless otherwise indicated, a Tm differential provided by a particular modified base is 
calculated by the following protocol (steps a) through d)): 

a) incorporating the modified base of interest into the following oligonucleotide 5*- 
d(GTGAMATGG), wherein M is the modified base; 
1 5 b) mixing 1.5x1 0'^M of the oligonucleotide having incorporated therein the 

modified base with each of 1 .5xlO"^M of the four oligonucleotides having the sequence 3 - 
d(CAGTYTAGG), wherein Y is A, C, G, T, respectively, in a buffer of lOmM sodium 
phosphate, 100 mM sodium chloride, 0.1 mM EDTA, pH 7.0; 
c) allowing the oligonucleotides to hybridize; and 
20 d) detecting the Tm for each of the four hybridized nucleotides by heating the 

hybridized nucleotides and observing the temperature at which the maximum of the first 
derivative of the melting curve recorded at a wavelength of 260 nm is.obtained. 

Unless otherwise indicated, a Tm differential for a particular modified base is determined 
by subtracting the highest Tm value determined in steps a) through d) immediately above fi'om 
25 the lowest Tm value determined by steps a) through d) immediately above. 

By "variance in Tm is meant the variance in the values of the melting temperatures for a 
population of nucleic acids. The Tm for each nucleic acid is determined by experimentally 
measuring or computationally predicting the temperature at which 50% of a population double- 
stranded molecules with the sequence of the nucleic acid becomes dissociated into single strands. 
30 For a nucleic acid with only A, T, C, G, and/or U bases, the Tm is the temperature at which 50% 
of a population of 100% complementary double-stranded molecules with the sequence of the 
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nucleic acid becomes dissociated into single strands. For determining the Tm variance when a 
nucleic acid has one or more nucleobases other than A, T, C, G, or U, the Tm of this "modified" 
nucleic acid is approximated by determining the Tm for each possible double stranded molecule 
in which one strand is the modified nucleic acid and the other strand has either A, T, C, or G in 
5 each position corresponding to a nucleobase other than A, T, C, G, or U in the modified nucleic 
acid. For example, if the modified nucleic acid has the sequence XMX in which X is 0, 1, or 
more A, T, C, G, or U bases and M is any other nucleobase or nucleosidic base, the Tm is 
calculated for each possible double stranded molecule in which one strand is XMX and the other 
strand is X'YX' in which X' is the base complementary to the corresponding X base and Y is 

10 either A, T, C, or G. The average is then calculated for the Tm values for each possible double 
stranded molecule (i.e., four possible duplexes per modified nucleobase or nucleoside base in the 
modified nucleic acid) and used as the approximate Tm value for the modified nucleic acid. 

By "capture efficiency" is meant the amount of target nucleic acid(s) bound to a 
particular nucleic acid or a population of nucleic acids. Standard methods can be used to 

15 calculate the capture efficiency by measuring the amount of bound target nucleic acid(s) and/or 
measuring the amount of unbound target nucleic acid(s). The capture efficiency of a nucleic acid 
or nucleic acid population of the invention is typically compared to the capture efficiency of a 
control nucleic acid or nucleic acid population under the same incubation conditions (e.g., using 
same buffer and temperature). 

20 For example, a control nucleic acid may have P-D-2-deoxyribose instead of one or more 

bicyclic or sugar groups of a LNA unit or other modified or non-naturally-occurring units in a 
nucleic acid of the invention. In some embodiments, the nucleic acid of the invention and the 
control nucleic acid only have naturally-occurring nucleobases. If a nucleic acid of the invention 
has one or more non-naturally-occurring nucleobases, the capture efficiency of the corresponding 

25 control nucleic acid is calculated as the average capture efficiency for all of the nucleic acids that 
have either A, T, C, or G in each position corresponding to a non-naturally-occurring nucleobase 
in the nucleic acid of the invention. 

Monomers are referred to as being "complementary" if they contain nucleobases that can 
form hydrogen bonds according to Watson-Crick base-pairing rules (e.g., G with C, A with T, or 

30 A with U) or other hydrogen bonding motifs such as for example diaminopurine with T, inosine 
with C, and pseudoisocytosine with G. 
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By "substantially complementarity" is meant having a sequence that is at least 60, 70, 80, 
90, 95, or 100% complementary to that of another sequence. Sequence complementarity is 
typically measured using sequence analysis software with the default parameters specified 
therein (e.g., Sequence Analysis Software Package of the Genetics Computer Group, University 
5 of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 53705). This 
software program matches similar sequences by assigning degrees of homology to various 
substitutions, deletions, and other modifications. 

The term "homology" refers to a degree of complementarity. There can be partial 
homology or complete homology (i.e., identity). A partially complementary sequence that at 
10 least partially inhibits a completely complementary sequence from hybridizing to a target nucleic 
acid is referred to using the functional term "substantially homologous." 

When used in reference to a double-stranded nucleic acid sequence such as a cDNA or 
genomic clone, the term "substantially homologous" refers to a probe that can hybridize to a 
strand of the double-stranded nucleic acid sequence under conditions of low stringency, e.g. 
15 using a hybridization buffer comprising 20% formamide in 0.8M saline/O.OSM sodium citrate 
(SSC) buffer at a temperature of 37°C and remaining bound when subject to washing once with 
thatSSC buffer at 37^, 

When used in reference to a single-stranded nucleic acid sequence, the term 
"substantially homologous" refers to a probe that can hybridize to (i.e., is the complement of) the 
20 single-stranded nucleic acid template sequence under conditions of low stringency, e.g. using a 
hybridization buffer comprising 20% formamide in 0.8M saline/0.08M sodium citrate (SSC) 
buffer at a temperature of 37°C and remaining bound when subject to washing once with that 
SSC buffer at 37^C. 

By "internal probe" is meant a nucleic acid (e.g., a probe or primer) that hybridizes to 
25 either only one exon or only one intron of a nucleic acid (e.g., mRNA). The internal probe may 
hybridize to the 5' end of the exon or intron, the 3' end of the exon or intron, or between the 5' 
end and the 3' end of the exon or intron. Desirably, the internal probe is at least 90, 95, 96, 97, 
98, 99, or 100% identical to the corresponding region of a target nucleic acid. 

By "merged probe" is meant a nucleic acid (e.g., a probe or primer) that hybridizes to 
30 more than one exon and/or intron of a nucleic acid (e.g., mRNA). Desirably, the merged probe 
hybridizes to two consecutive exons (e.g., exons in a mature mRNA transcript that may or may 
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not be consecutive in the corresponding DNA molecule). In another desirable embodiment, the 
merged probe hybridizes to an exon and the consecutive intron. In desirable embodiments, the 
merged probe hybridizes to the same number of nucleotides in each exon or to the same number 
of nucleotides in the exon and intron. In various embodiments, the length of the region of the 
5 merged probe that hybridizes to one exon differs by less than 60, 40, 20, 10, or 5% from the 
length of the region of the merged probe that hybridizes to the other exon or to the intron. 
Desirably, the merged probe is at least 90, 95, 96, 97, 98, 99, or 100% identical to the 
corresponding region of a target nucleic acid. 

By "poly-T2o tail " is meant a DNA polymer consisting of 20 DNA-t units added by 
10 polymerase chain reaction as a tail to a nucleic acid sequence, which is subsequently cloned in a 
plasmid vector allowing in vitro synthesis of poly(A)2o polyadenylated RNA, 

By "mixmer" or "mixmer probe** is meant a nucleic acid (e.g., a probe or primer) that 
contains at least one LNA unit and at least one RNA or DNA unit (e.g., a naturally-occurring 
RNA or DNA unit). 

15 By "corresponding unmodified reference nucleobase" is meant a nucleobase that is not 

part of an LNA unit and is in the same orientation as the nucleobase in an LNA unit. 

By "mutation" is meant an alteration in a naturally-occurring or reference nucleic acid 
sequence, such as an insertion, deletion, frameshift mutation, silent mutation, nonsense mutation, 
or missense mutation. Desirably, the amino acid sequence encoded by the nucleic acid sequence 

20 has at least one amino acid alteration from a naturally-occurring sequence. 

By "selecting" is meant substantially partitioning a molecule from other molecules in a 
population. Desirably, the partitioning provides at least a 2-fold, desirably, a 30-fold, more 
desirably, a 100-fold, and most desirably, a 1,000-fold enrichment of a desired molecule relative 
to undesired molecules in a population following the selection step. The selection step may be 

25 repeated a number of times, and different types of selection steps may be combined in a given 
approach. The population desirably contains at least 10^ molecules, more desirably at least 10^\ 
10^^, or 10^"^ molecules and, most desirably, at least 10^^ molecules. 

By a "population" is meant more than one nucleic acid. A "population" according to the 
invention desirably means more than lO', 10^, 10^, or 10"* different molecules. 

30 By "photochemically active groups" is meant compounds which are able to undergo 

chemical reactions upon irradiation with light. Illustrative examples of fiinctional groups are 
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quinones, especially 6-methyM ,4-naphtoquinone, anthraquinone, naphtoquinone, and 1,4- 
dimethyl-anthraquinone, diazirines, aromatic azides, benzophenones, psoralens, diazo 
compounds, and diazirino compounds. 

By "thermochemically reactive group" is meant a functional group which is able to 
5 undergo thermochemically-induced covalent bond formation with other groups. Illustrative 
examples of functional parts of thermochemically reactive groups are carboxylic acids, 
carboxylic acid esters such as activated esters, carboxylic acid halides such as acid fluorides, acid 
chlorides, acid bromide, acid iodides, carboxylic acid azides, carboxylic acid hydrazides, 
sulfonic acids, sulfonic acid esters, sulfonic acid halides, semicarbazides, thiosemicarbazides, 

10 aldehydes, ketones, primary alcohols, secondary alcohols, tertiary alcohols, phenols, alkyl 
halides, thiols, disulphides, primary amines, secondary amines, tertiary amines, hydrazines, 
epoxides, maleimides, and boronic acid derivatives. 

By "chelating group" is meant a molecule that contains more than one binding site and 
frequently binds to another molecule, atom, or ion through more than one binding site at the 

15 same time. Examples of functional parts of chelating groups are iminodiacetic acid, 

nitrilotriacetic acid, ethyl enediamine tetraacetic acid (EDTA), and aminophosphonic acid. 

By "reporter group" is meant a group which is detectable either by itself or as a part of an 
detection series. Examples of functional parts of reporter groups are biotin, digoxigenin, 
fluorescent groups (e.g., groups which are able to absorb electromagnetic radiation, e.g. light or 

20 X-rays, of a certain wavelength, and which subsequently reemit the energy absorbed as radiation 
of longer wavelength; such as dansyl (5-dimethylamino)-l-naphthalenesulfonyl), DOXYL (N- 
oxyl-4,4-dimethyloxazolidine), PROXYL (N-oxyl-2,2,5,5-tetramethylpyrrolidine), TEMPO (N- 
oxyl-2,2,6,6-tetramethylpiperidine), dinitrophenyl, acridines, coumarins, Cy3 and Cy5 
(trademarks for Biological Detection Systems, Inc.), erythrosine, coumaric acid, umbelliferone, 

25 Texas red, rhodamine, tetramethyl rhodamine, Rox, 7-nitrobenzo-2-oxa-l-diazole (NBD), 

pyrene, fluorescein, Europium, Ruthenium, Samarium, and other rare earth metals), radioisotopic 
labels, chemiluminescence labels (i.e., labels that are detectable via the emission of light during a 
chemical reaction), spin labels (a free radical e.g., substituted organic nitroxides) or other 
paramagnetic probes (e.g., Cu^'*^or Mg^"*") bound to a biological molecule being detectable by the 

30 use of electron spin resonance spectroscopy), enzymes (such as peroxidases, alkaline 

phosphatases, (3-galactosidases, and glycose oxidases), antigens, antibodies, haptens (e.g., groups 
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which are able to combine with an antibody, but which cannot initiate an immune response by 
itself, such as peptides and steroid hormones), carrier systems for cell membrane penetration, 
fatty acid units, steroid moieties (cholesteryl), vitamin A, vitamin D, vitamin E, folic acid 
peptides for specific receptors, groups for mediating endocytose, epidermal growth factor (EGF), 
5 bradykinin, and platelet derived growth factor (PDGF). Especially desirable groups are biotin, 
fluorescein, Texas Red, rhodamine, dinitrophenyl, digoxigenin. Ruthenium, Europium, Cy5, and 
Cy3. 

By "ligand" is meant a compound which binds. Ligands can comprise functional groups 
such as aromatic groups (such as benzene, pyridine, naphthalene, anthracene, and phenanthrene), 

10 heteroaromatic groups (such as thiophene, furan, tetrahydroftiran, pyridine, dioxane, and 

pyrimidine), carboxylic acids, carboxylic acid esters, carboxylic acid halides, carboxylic acid 
azides, carboxylic acid hydrazides, sulfonic acids, sulfonic acid esters, sulfonic acid halides, 
semicarbazides, thiosemicarbazides, aldehydes, ketones, primary alcohols, secondary alcohols, 
tertiary alcohols, phenols, alkyl halides, thiols, disulphides, primary amines, secondary amines, 

15 tertiary amines, hydrazines, epoxides, maleimides, CrCao alkyl groups optionally interrupted or 
terminated with one or more heteroatoms such as oxygen atoms, nitrogen atoms, and/or sulphur 
atoms, optionally containing aromatic or mono/polyunsaturated hydrocarbons, polyoxyethylene 
such as polyethylene glycol, oligo/polyamides such as poly-a-alanine, polyglycine, polylysine, 
peptides, oligo/polysaccharides, oligo/polyphosphates, toxins, antibiotics, cell poisons, and 

20 steroids, "Affinity ligands" include functional groups or biomolecules that have a specific 
affinity for sites on particular proteins, antibodies, poly- and oligosaccharides, and other 
biomolecules. 

It should be understood that the above-mentioned specific examples under DNA 
intercalators, photochemically active groups, thermochemically active groups, chelating groups, 

25 reporter groups, and ligands correspond to the "active/functional" part of the groups in question. 
For the person skilled in the art it is furthermore clear that DNA intercalators, photochemically 
active groups, thermochemically active groups, chelating groups, reporter groups, and ligands 
are typically represented in the form M-K- where M is the "active/functional" part of the group 
in question and where K is a spacer through which the "active/functional" part is attached to the 

30 5- or 6-membered ring. Thus, it should be understood that the group B, in the case where B is 
selected from DNA intercalators, photochemically active groups, thermochemically active 
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groups, chelating groups, reporter groups, and ligands, has the form M-K-, where M is the 
"active/functionar* part of the DNA intercalator, photochemically active group, 
thermochemically active group, chelating group, reporter group, and Hgand, respectively, and 
where K is an optional spacer comprising 1-50 atoms, desirably 1-30 atoms, in particular 1-15 
5 atoms, between the 5- or 6-membered ring and the "active/functional" part. 

By "spacer" is meant a thermochemically and photochemically non-active distance- 
making group and is used to join two or more different moieties of the types defined above. 
Spacers are selected on the basis of a variety of characteristics including their hydrophobicity, 
hydrophilicity, molecular flexibility and length (e.g., Hermanson et ai, "Immobilized Affinity 

10 Ligand Techniques," Academic Press, San Diego, California (1992). Generally, the length of the 
spacers is less than or about 400 A, in some applications desirably less than 100 A. The spacer, 
thus, comprises a chain of carbon atoms optionally interrupted or terminated with one or more 
heteroatoms, such as oxygen atoms, nitrogen atoms, and/or sulphur atoms. Thus, the spacer K 
may comprise one or more amide, ester, amino, ether, and/or thioether functionalities, and , 

1 5 optionally aromatic or mono/polyunsaturated hydrocarbons, polyoxyethylene such as 

polyethylene glycol, oligo/polyamides such as poly-a-alanine, polyglycine, polylysine, peptides, 
oligosaccharides, or oligo/polyphosphates. Moreover the spacer may consist of combined units 
thereof The length of the spacer may vary, taking into consideration the desired or necessary 
positioning and spatial orientation of the "active/functional" part of the group in question in 

20 relation to the 5- or 6-membered ring. In particularl embodiments, the spacer includes a 

chemically cleavable group. Examples of such chemically cleavable groups include disulphide 
groups cleavable under reductive conditions and peptide fragments cleavable by peptidases. 

By "target nucleic acid" or "nucleic acid target" is meant a particular nucleic acid 
sequence of interest. Thus, the "target" can exist in the presence of other nucleic acid molecules 

25 or within a larger nucleic acid molecule. 

By "solid support" is meant any rigid or semi-rigid material to which a nucleic acid binds 
or is directly or indirectly attached. The support can be any porous or non-porous water 
insoluble material, including without limitation, membranes, filters, chips, slides, wafers, fibers, 
magnetic or nonmagnetic beads, gels, tubing, strips, plates, rods, polymers, particles, 

30 microparticles, capillaries, and the like. The support can have a variety of surface forms, such as 
wells, trenches, pins, channels and pores. 
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By an "array" is meant a fixed pattern of at least two different immobilized nucleic acids 
on a solid support. Desirably, the array includes at least 10 , more desirably, at least 10 , and, 
most desirably, at least 1 0"* different nucleic acids. 

By "antisense nucleic acid" is meant a nucleic acid, regardless of length, that is 
5 complementary to a coding strand or mRNA of interest. In some embodiments, the antisense 
molecule inhibits the expression of only one nucleic acid, and in other embodiments, the 
antisense molecule inhibits the expression of more than one nucleic acid. Desirably, the 
antisense nucleic acid decreases the expression or biological activity of a nucleic and or encoded 
protein by at least 20, 40, 50, 60, 70, 80, 90, 95, or 100%. An antisense molecule can be 

10 introduced, e.g., to an individual cell or to whole animals, for example, it may be introduced 
systemically via the bloodstream. Desirably, a region of the antisense nucleic acid or the entire 
antisense nucleic acid is at least 70, 80, 90, 95, 98, or 100% complementary to a coding 
sequence, regulatory region (5* or 3' untranslated region), or an mRNA of interest. Desirably, the 
region of complementarity includes at least 5, 10, 20, 30, 50, 75,100, 200, 500, 1000, 2000 or 

15 5000 nucleotides or includes all of the nucleotides in the antisense nucleic acid. 

In some embodiments, the antisense molecule is less than 200, 150, 100, 75, 50, or 25 
nucleotides in length. In other embodiments, the antisense molecule is less than 50,000; 10,000; 
5,000; or 2,000 nucleotides in length. In certain embodiments, the antisense molecule is at least 
200, 300, 500, 1000, or 5000 nucleotides in length. In some embodiments, the number of 

20 nucleotides in the antisense molecule is contained in one of the following ranges: 5-15 

nucleotides, 16-20 nucleotides, 21-25 nucleotides, 26-35 nucleotides, 36-45 nucleotides, 46-60 
nucleotides, 61-80 nucleotides, 81-100 nucleotides, 101-150 nucleotides, or 151-200 nucleotides, 
inclusive. In addition, the antisense molecule may contain a sequence that is less than a full- 
length sequence or may contain a full-length sequence. 

25 By "double stranded nucleic acid" is meant a nucleic acid containing a region of two or 

more nucleotides that are in a double stranded conformation. In various embodiments, the 
double stranded nucleic acids consist entirely of LNA units or a mixture of LNA units, 
ribonucleotides, and/or deoxynucleotides. The double stranded nucleic acid may be a single 
molecule with a region of self-complementarity such that nucleotides in one segment of the 

30 molecule base-pair with nucleotides in another segment of the molecule. Alternatively, the 
double stranded nucleic acid may include two different strands that have a region of 
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complementarity to each other. Desirably, the regions of complementarity are at least 70, 80, 90, 
95, 98, or 100% identical. Desirably, the region of the double stranded nucleic acid that is 
present in a double stranded conformation includes at least 5, 10, 20, 30, 50, 75,100, 200, 500, 
1000, 2000 or 5000 nucleotides or includes all of the nucleotides in the double stranded nucleic 
5 acid. Desirable double stranded nucleic acid molecules have a strand or region that is at least 70, 
80, 90, 95, 98, or 100% identical to a coding region or a regulatory sequence (e.g., a transcription 
factor binding site, a promoter, or a 5* or 3' untranslated region) of a nucleic acid of interest. In 
some embodiments, the double stranded nucleic acid is less than 200, 150, 100, 75, 50, or 25 
nucleotides in length. In other embodiments, the double stranded nucleic acid is less than 

10 50,000; 10,000; 5,000; or 2,000 nucleotides in length. In certain embodiments, the double 
stranded nucleic acid is at least 200, 300, 500, 1000, or 5000 nucleotides in length. In some 
embodiments, the number of nucleotides in the double stranded nucleic acid is contained in one 
of the following ranges: 5-15 nucleotides, 16-20 nucleotides, 21-25 nucleotides, 26-35 
nucleotides, 36-45 nucleotides, 46-60 nucleotides, 61-80 nucleotides, 81-100 nucleotides, 101- 

15 150 nucleotides, or 151-200 nucleotides, inclusive. In addition, the double stranded nucleic acid 
may contain a sequence that is less than a full-length sequence or may contain a full-length 
sequence. 

In some embodiments, the double stranded nucleic acid inhibits the expression of only 
one nucleic acid, and in other embodiments, the double stranded nucleic acid molecule inhibits 

20 the expression of more than one nucleic acid. Desirably, the nucleic acid decreases the 

expression or biological activity of a nucleic acid of interest or a protein encoded by a nucleic 
acid of interest by at least 20, 40, 50, 60, 70, 80, 90, 95, or 100%. A double stranded nucleic 
acid can be introduced, e.g., to an individual cell or to whole animals, for example, it may be 
introduced systemically via the bloodstream. 

25 In various embodiments, the double stranded nucleic acid or antisense molecule includes 

one or more LNA nucleotides, one or more universal bases, and/or one or more modified 
nucleotides in which the T position in the sugar (e.g., ribose or xylose) contains a halogen (such 
as fluorine group) or contains an alkoxy group (such as a methoxy group) which increases the 
half-life of the double stranded nucleic acid or antisense molecule in vitro or in vivo compared to 

30 the corresponding double stranded nucleic acid or antisense molecule in which the corresponding 
2' position contains a hydrogen or an hydroxyl group. In yet other embodiments, the double 
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stranded nucleic acid or antisense molecule includes one or more linkages between adjacent 
nucleotides other than a naturally-occurring phosphodiester linkage. Examples of such linkages 
include phosphoramide, phosphorothioate, and phosphorodithioate linkages. Desirably, the 
double stranded or antisense molecule is purified. 
5 By "purified" is meant separated from other components that naturally accompany it. 

Typically, a factor is substantially pure when it is at least 50%, by weight, free from proteins, 
antibodies, and naturally-occurring organic molecules with which it is naturally associated. 
Desirably, the factor is at least 75%, more desirably, at least 90%, and most desirably, at least 
99%, by weight, pure. A substantially pure factor may be obtained by chemical synthesis, 

10 separation of the factor from natural sources, or production of the factor in a recombinant host 
cell that does not naturally produce the factor. Nucleic acids and proteins may be purified by one 
skilled in the art using standard techniques such as those described by Ausubel et ah (Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, 2000). The factor is desirably 
at least 2, 5, or 10 times as pure as the starting material, as measured using polyacrylamide gel 

15 electrophoresis, column chromatography, optical density, HPLC analysis, or western analysis 
(Ausubel et al, supra). Desirable methods of purification include immunoprecipitation, column 
chromatography such as immunoaffinity chromatography, magnetic bead immunoaffmity 
purification, and panning with a plate-bound antibody. 

By "treating, stabilizing, or preventing a disease, disorder, or condition" is meant 

20 preventing or delaying an initial or subsequent occurrence of a disease, disorder, or condition; 
increasing the disease-free survival time between the disappearance of a condition and its 
reoccurrence; stabilizing or reducing an adverse symptom associated with a condition; or 
inhibiting or stabilizing the progression of a condition. Desirably, at least 20, 40, 60, 80, 90, or 
95% of the treated subjects have a complete remission in which all evidence of the disease 

25 disappears. In another desirable embodiment, the length of time a patient survives after being 
diagnosed with a condition and treated with a nucleic acid of the invention is at least 20, 40, 60, 
80, 100, 200, or even 500% greater than (i) the average amount of time an untreated patient 
survives or (ii) the average amount of time a patient treated with another therapy survives. 

By "treating, stabilizing, or preventing cancer" is meant causing a reduction in the size of 

30 a tumor, slowing or preventing an increase in the size of a tumor, increasing the disease-free 

survival time between the disappearance of a tumor and its reappearance, preventing an initial or 
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subsequent occurrence of a tumor, or reducing an adverse symptom associated with a tumor. In 
one desirable embodiment, the number of cancerous cells surviving the treatment is at least 20, 
40, 60, 80, or 100% lower than the initial number of cancerous cells, as measured using any 
standard assay. Desirably, the decrease in the number of cancerous cells induced by 
5 administration of a nucleic acid of the invention (e.g., a nucleic acid with substantial 

complementarily to a nucleic acid associated with cancer such as an oncogene) is at least 2, 5, 
10, 20, or 50- fold greater than the decrease in the number of non-cancerous cells. In yet another 
desirable embodiment, the number of cancerous cells present after administration of a nucleic 
acid of the invention is at least 2, 5, 10, 20, or 50-fold lower than the number of cancerous cells 

10 present prior to the administration of the compound or after administration of a buffer control. 
Desirably, the methods of the present invention result in a decrease of 20, 40, 60, 80, or 100% in 
the size of a tumor as determined using standard methods. Desirably, at least 20, 40, 60, 80, 90, 
or 95% of the treated subjects have a complete remission in which all evidence of the cancer 
disappears. Desirably, the cancer does not reappear or reappears after at least 5, 10, 15, or 20 

15 years. 

Exemplary cancers that can be treated, stabilized, or prevented using the above methods 
include prostate cancers, breast cancers, ovarian cancers, pancreatic cancers, gastric cancers, 
bladder cancers, salivary gland carcinomas, gastrointestinal cancers, lung cancers, colon cancers, 
melanomas, brain tumors, leukemias, lymphomas, and carcinomas. Benign tumors may also be 

20 treated or prevented using the methods and nucleic acids of the present invention. 

By "infection" is meant the invasion of a host animal by a pathogen (e.g., a bacteria, 
yeast, or virus). For example, the infection may include the excessive growth of a pathogen that 
is normally present in or on the body of an animal or growth of a pathogen that is not normally 
present in or on the animal. More generally, an infection can be any situation in which the 

25 presence of a pathogen population(s) is damaging to a host. Thus, an animal is "suffering" fi-om 
an infection when an excessive amount of a pathogen population is present in or on the animal's 
body, or when the presence of a pathogen population(s) is damaging the cells or other tissue of 
the animal. In one embodiment, the number of a particular genus or species of pathogen is at 
least 2, 4, 6, or 8 times the number normally found in the animal. 

30 A bacterial infection may be due to gram positive and/or gram negative bacteria. In 

desirable embodiments, the bacterial infection is due to one or more of the following bacteria: 
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Chlamydophila pneumoniae, C, psittaci, C abortus^ Chlamydia trachomatis^ Simkania 
negevensis, Parachlamydia acanthamoebae, Pseudomonas aeruginosa, P. alcaligenes, P. 
chlororaphis, P.fluorescens,P. luteola^P. mendocina.P. monteilii, P. oryzihabitans.P. 
pertocinogena, P. pseudalcaligenes, P. putida, P. stutzeri, Burkholderia cepacia, Aeromonas 
5 hydrophilia, Escherichia coli, Citrobacter freundii. Salmonella typhimurium, S. typhi, S, 
paratyphi, S, enteritidis. Shigella dysenteriae, S, flexneri, 5. sonnei, Enterobacter cloacae, E. 
aerogenes, Klebsiella pneumoniae, K. oxytoca, Serratia marcescens, Francisella tularensis, 
Morganella morganii, Proteus mirabilis, Proteus vulgaris, Providencia alcalifaciens, P. rettgeri, 
P. stuartii, Acinetobacter calcoaceticus, A, haemolyticus, Yersinia enterocolitica, Y, pestis, Y, 

10 pseudotuberculosis, Y, intermedia, Bordetella pertussis, B, parapertussis, B, bronchiseptica, 

Haemophilus influenzae, H. parainfluenzae, H. haemolyticus, H. parahaemolyticus, H. ducreyi, 
Pasteurella multocida, P. haemolytica, Branhamella catarrhalis, Helicobacter pylori, 
Campylobacter fetus, C. jejuni, C. coli, Borrelia burgdorferi, V. cholerae, V. parahaemolyticus, 
Legionella pneumophila, Listeria monocytogenes, Neisseria gonorrhea, K meningitidis, 

1 5 Kingella dentrificans, K, kingae, K, oralis, Moraxella catarrhalis, M. atlantae, M, lacunata, M. 
nonliquefaciens, M. osloensis, M phenylpyruvica, Gardnerella vaginalis; Bacteroides fragilis, 
Bacteroides distasonis, Bacteroides 3452A homology group, Bacteroides vulgatus, B. ovalus, B, 
thetaiotaomicron, B, uniformis, B. eggerthii, B. splanchnicus, Clostridium difficile, 
Mycobacterium tuberculosis, M avium, M. intracellulare, M leprae, C. diphtheriae, C 

20 ulcerans, C. accolens, C. afermentans, C amycolatum, C. argentorense, C. auris, C. bovis, C. 
confusum, C coyleae, C. durum, C. falsenii, C. glucuronolyticum, C imitans, C. jeikeium, C 
kutscheri, C kroppenstedtii, C. lipophilum, C. macginleyi, C matruchoti, C mucifaciens, C. 
pilosum, C. propinquum, C renale, C riegelii, C sanguinis, C singulare, C striatum, C. 
sundsvallense, C. thomssenii, C urealyticum, C. xerosis , Streptococcus pneumoniae, S, 

25 agalactiae, S. pyogenes, Enterococcus avium, E. casseliflavus, E. cecorum, E. dispar, E. durans, 
E, faecalis, E. faecium, E, flavescens, E. gallinarum, E, hirae, E, malodoratus, E. mundtii, E. 
pseudoavium, E, raffinosus, E, solitarius. Staphylococcus aureus, S. epidermidis, S. 
saprophyticus, 5. intermedius, S. hyicus, S. haemolyticus, S. hominis, and/or iS. saccharolyticus. 
Desirably, a nucleic acid is administered in an amount sufficient to prevent, stabilize, or inhibit 

30 the growth of a pathogenic bacteria or to kill the bacteria. 
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In various embodiments, the viral infection relevant to the methods of the invention is an 
infection by one or more of the following viruses: West Nile virus (e.g., Samuel, "Host genetic 
variability and West Nile virus susceptibility," Proc. Natl. Acad. Sci. USA August 21, 2002; 
Beasley, Virology 296:17-23, 2002), Hepatitis, picomarirus, polio, HIV, coxsacchie, herpes 
5 (e.g., zoster, simplex, EBV, or CMV), adenovirus, retrovius, falvi, pox, rhabdovirus, picoma 
virus (e.g., coxsachie, entero, hoof and mouth, polio, or rhinovirus), St. Louis encephalitis, 
Epstein-Barr, myxovirus, JC, coxsakievirus B, togavirus, measles, paramyxovirus, echovirus, 
bunyavirus, cytomegalovirus, varicella-zoster, mumps, equine encephalitis, lymphocytic 
choriomeningitis, rabies, simian virus 40, polyoma virus, parvovirus, papilloma virus, primate 
10 adenovirus, and/or BK. 

By "mammal in need of treatment" is meant a mammal in which a disease, disorder, or 
condition is treated, stabilized, or prevented by the administration of a nucleic acid of the 
invention. 

Other aspects and embodiments of the invention are in the detailed description and claims 
15 below. Additionally, other nucleic acids and methods described in U.S. S.N. 10/105,639 
(Jakobsen et al, "Modified Oligonucleotides and Uses Thereof) or U.S.S.N. 60/410,061 
(Ramsing et ai, "Populations of Oligonucleotides with Duplex Stabilizing Properties and Uses 
Thereof) which are hereby incorporated by reference, can be used in the present invention. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the structures of selected nucleotide monomers: DNA (T), LNA (T*-), 
pyrene DNA (Py), 2'-0Me-RNA [2'-OMe(T)], abasic LNA (Ab^), phenyl LNA (17a), and 
pyrenylLNA (17d). 

Figure 2 illustrates the chemical structures of Selective Binding Complementary (SBC) 
25 nucleotides. 

Figure 3 shows the base-pairing between modified bases and naturally-occurring 
nucleotides. These modified bases may be incorporated as part of an LNA, DNA, or RNA unit 
and used any of the oligomers of the invention. 

Figure 4A and 4B show the sensitivity of 50-mer LNA capture probes compared to 50- 
30 mer DNA capture probes. SWl 5-specific 50-mer DNA oligonucleotides (green bars) and 50-mer 
capture probes with an LNA nucleotide incorporated at every third nucleotide position (red bars) 
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were printed at the oligo concentration indicated below. The sHdes were hybridized at 65°C in 
3x SSC (Fig. 4A) and at 70X in 3xSSC (Fig. 4B). 

Figures 5A and 5B show the specificity of 40-mer LNA capture probes (red bars) 
compared to DNA capture probes (green bars). The hybridizations were carried out at 65°C in 
5 3XSSC. Bars 1 and 7 represent perfectly matched duplexes, bars 2 and 8, 3, and 9, 4 and 10, 5 
and 1 1, and 6 and 12 represent duplexes with 1, 2, 3, 4, 5 mismatches, respectively. The in vitro 
RNA used was SW15 in Fig 5A and TH14 in Fig. 5B. 

Figure 6 shows the detection principle for alternative exon skipping in the C. elegans let- 
2 gene using LNA oligonucleotide capture probes and comparative expression profiling. 
10 Figure 7 shows the detection of alternative splicing of C elegans Let-2 exon 9 and 10 

using LNA-modified capture probes. 

Figures 8 A and 8B show the comparison of DNA and LNA-modified oligonucleotide 
capture probes in the specific capture of the C elegans T01D3.3 mRNA, exon 4. 

Figure 9 illustrates the LNA exon-exon junction (merged) probe concept. 
15 Figures lOA and lOB show the capture probe specificity for the C. elegans T01D3.3 

mRNA, exon 4 (Fig. lOA) and exon 5 (Fig. lOB) as validated by short complementary target 
oligonucleotides. 

Figure 1 1 shows the construction of the recombinant splice variants in the in vitro 
transcription vector. The small bars show the location of the hybridization for the 
20 oligonucleotide capture probes used in this example. The sequences of the capture probes are 
described herein. 

Figures 12A (LNA probes) and 12B (DNA control probes) show the detection of splice 
variant #1 and #2, respectively using merged capture probes in a comparative, two-color 
hybridization. 

25 Figure 13 shows the sensitivity of 50-mer LNA capture probes compared to 50-mer DNA 

capture probes. SWI5-specific 50-mer DNA oligonucleotides and 50-mer capture probes with an 
LNA nucleotide incorporated at every second (LNA2) or third (LNA3) nucleotide position. The 
slides were hybridized at 65°C in 3xSSC. 

Figure 14 is a bar graph of the signal intensities of a patient DNA sample hybridized to 

30 an array of the invention. The names of the probes in Figs. 14 and 17 match, although the 

numbers used in Fig. 14 are abbreviated, e.g., probe No. 10580 Menkes. 14 50NH2C6-2.LNA in 
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Fig. 17 corresponds to the second probe counted from the left "14.2" LNA in the lower graph of 
Fig. 14. 

Figure 15 is a graph comparing the spot intensity for probes of the invention with 
different LNA substitution patterns. 
5 Figure 16 is a bar graph of the spot intensity for LNA probes for different exons. 

Figure 17 is a table of comparative genome hybridization (CGH) capture probe 
sequences. 

Figure 18 is a flow chart of the steps of oligo design software of the invention. The 
OligoDesign software features LNA modified oligonucleotide secondary structure prediction, 

10 LNA spiked oligonucleotide melting temperature prediction, genome wide cross hybridization 
prediction, secondary structure prediction of the target, and recognition and filtering of the target 
in the genome. These features are determined for each possible probe of the query gene and 
presented to an artificial neural network. The probes are then ranked according to the neural 
network prediction and the top scoring probes are returned. 

15 Figures 19A-19F are a schematic illustration of the OligoDesign software of the 

invention. 

Figure 20 illustrates photo-activated immobilization of nucleic acids of the invention, 
which enables polarized coupling of anthraquinone (AQ)-linked LNA oligonucleotides onto the 
polymer surface. No pretreatment of the slide is needed. A covalent bond is formed between the 
20 oligonucleotide and the polymer using a UV source, e.g. Stratalinker. 

Figure 21 illustrates an injection-molded polymer slide. Finger indents ease slide 
handling. The slide has a well-defined printing and hybridization window, frosted surface for 
identification and orientation, and space for barcodes. 

Figure 22 illustrates spot quality on different slides that can be used to immobilize 
25 nucleic acids of the invention. The hydrophobic slide surface ensures that extremely 

homogenous spots are generated when hydrophilic spotting solution is applied to the surface. A 
high spot quality is obtained on the Immobilizer™ polymer slide compared to a glass slide when 
using a spot-to-spot distance of 150 jxM. The high-quality arrays simplify downstream image 
analyses. 

30 Figure 23 is a schematic illustration of a method of the invention. 

Figure 24 is a table of exemplary target nucleic acids (Holstege et al. (1998)( Cell 95, 
717-728, and Causton et al. (2001) Mol. Biol. Cell 12, 323-337). 
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Figure 25 is a graph of Cy5 intensity. Yeast actin 1 -specific 50-mer capture probes were 
synthesized as DNA and DNA/LNA mixmer oligonucleotides. LNA-substituted mixmer capture 
probes contain an LNA at every 4^^, 5^^, and 6**^ nucleotide position (LNA_4, LNA_5, LNA_6). 
On-chip melting profiles demonstrate a 8-10 °C increase in Tm obtained with LNA capture 
probes. 

Figure 26A illustrates the heat-shock response in yeast. The array was hybridized with 
Cy3-labeled standard and Cy5-labelled heat-shock yeast cDNA. Figure 26B also illustrates the 
heat-shock response in yeast. The microarray data were normalized using yeast actin 1 . The 
ssa4 gene encoding heat shock protein HSP70 is up-regulated over 2-fold. Expression of the 
gual gene is down-regulated. 

Figure 27 A compares expression of wild-type and ^-^a^ mutant yeast. The array was 
hybridized with Cy3-labeled wild-type and Cy5-labelled ssa4 mutant yeast cDNA. Figure 27B 
also compares wild-type and ssa4 yeast. The hybridization data were normalized using yeast 
actin 1. ssa4 is detected in the wild-type yeast strain, but not in the ssa4 knock-out strain. 

Figure 28 illustrates mRNA splicing. 

Figure 29 is a picture showing gel electrophoresis of fi*agmented cDNA fi'om the yeast 
wild-type strain. The molecular marker (lane 1 and 9) is from Life technologies, USA. Lanes 2- 
8 represents the UDG-fragmented cDNA 1-7 according to the different dUTP/dTTP ratios in 
Table 18. 

Figure 30 is a graph of the log ratios of the normalized fluorescence intensities fi'om the 
wild-type yeast strain (signal) and those from the A^sa^ yeast strain (noise) as a function of 
capture probe position in the 3' region of the SSA4 mRNA. 

Figure 31 is a schematic illustration of mRNA splicing. 

Figure 32 is a schematic illustration of alternative mRNA splicing. 

Figure 33 is a schematic illustration of probes of the invention. 

Figure 34 is a schematic illustration of probes of the invention. 

Figure 35 illustrates an exemplary computer for use in the methods of the invention. 

Figures 36a-36d show the sensitivity and specificity of LNA oligonucleotide capture 
probes (black solid bars) compared to DNA capture probes (white, open bars) on expression 
microarrays. Fluorescence intensity is shown in arbitrary units (relative measurements). The 
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arrays comprising 50-mer and 40-mer perfect match and 1-5 mismatch capture probes were 
hybridized at eS^'C in 3xSSC with Cy3-labelled cDNA from 10 C elegans total RNA spiked 
with yeast a) SWI5 RNA and c) THI4 RNA. Figures 36b and 36d demonstrate the improved 
mismatch discrimination with the 50-mer LNA probes by increasing the hybridization 
5 temperature from 65 "^C to 70 °C hybridized with Cy3-labelled cDNA from 10 ^ig C. elegans 
total RNA spiked with yeast b) SWI5 RNA and d) THI4 RNA. 

Figure 37 shows the expected (black, solid bars) and observed (white, open bars) fold-of- 
change in the expression levels of the Cy3-ULS-labelled yeast HSP78 spike RNA as measured 
by on-chip capture using three different 25-mer oligonucleotide capture probes (DNA control, 

10 LNA-T substituted, LNA_3 substituted in which every third nucleotide was substituted with an 
LNA monomer). In the hybridization experiment, one ng of HSP78 in vitro spike RNA or 200 pg 
HSP78 in vitro spike RNA was used, respectively. Thus, the fold change of the HSP78 RNA in 
the two hybridizations in the comparison is 5-fold. Fourteen additional synthetic in vitro mRNA 
spike controls were included in the hybridisation solution as a semi-complex background RNA 

15 mixture. Seven of these spikes were used as normalization controls, the remaining seven were 
used as negative controls. Hybridization temperature was 65°C for 16 hours, and post- 
hybridization washes as described. Both LNA_T and LNA_3 substituted 25-mer probes are 
capable of providing highly accurate measurements for fold-of-changes in gene expression 
levels, as depicted in Figure 37. Under these conditions the DNA capture probes did not 

20 hybridize. 

Figure 38 shows the measured intensity levels by on-chip capture using three different 
25-mer oligonucleotide capture probe designs ( DNA control, LNA T substituted and LNA C 
and T substituted probes). One (1) ng biotin-labeled HSP78 target was used in the hybridization 
experiments, followed by staining with Streptavidin Phycoerythrin. The LNA_T and LNA_TC 

25 substituted 25-mer capture probes show a significantly enhanced on-chip capture of the HSP78 
RNA target, compared to the DNA 25-mer control probes under four different hybridization 
stringency conditions in dicated on the graph. 

Figure 39 shows the detection of alternatively spliced mRNAs using LNA-substituted 50- 
mer oligonucleotide capture probes. Parts per million (ppm) calculations indicate spike 

30 transcripts per total transcripts in the hybridisation mix. Calculations are based on an average C 
elegans RNA being 1000 nucleotides as in Hill et al, (2000) Science 290:809-812. The 50-mer 
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LNA-DNA mixmer capture probes, substituted with an LNA nucleotide at every third nucleotide 
position, are able to provide highly accurate measurements for fold-changes in the expression of 
three homologous, alternatively spliced mRNA variants in the concentration range of 1000 ppm 
to 10 ppm. The quantification of the splice isoforms was carried out using a set of both internal, 
5 exon-specific probes and merged, splice junction specific probes, printed onto microarrays and 
hybridized with complex cDNA target pools spiked with different cloned artificial splice 
isoforms in which the middle exon was either alternatively skipped or excluded completely 
resulting in the three different splice isoforms; 01-INS3-03, 01-INS4-03 and 01-03. 

Figure 40 shows the detection of alternatively spliced mRNAs using LNA-substituted 40- 

10 mer oligonucleotide capture probes. Parts per million (ppm) calculations indicate spike 

transcripts per total transcripts in the hybridisation mix. Calculations are based on an average C. 
elegans RNA being 1000 nucleotides as in Hill et al (2000) Science 290:809-812. The 40-mer 
LNA-DNA mixmer capture probes, substituted with an LNA nucleotide at every third nucleotide 
position, are able to provide highly accurate measurements for fold-changes in the expression of 

1 5 three homologous, alternatively spliced mRNA variants in the concentration range of 1000 ppm 
to 10 ppm. The quantification of the splice isoforms was carried out using a set of both internal, 
exon-specific probes and merged, splice junction specific probes, printed onto microarrays and 
hybridized with complex cDNA target pools spiked with different cloned artificial splice 
isoforms in which the middle exon was either alternatively skipped or excluded completely 

20 resulting in the three different splice isoforms; 01-INS3-03, 01-INS4-03 and 01-03. 

Figures 41 A-41E show the comparison of different LNA/DNA mixmer oligonucleotide 
probes in the detection of human satellite-2 repeats by fluorescence in situ hybridization. 
Experiment conditions: 6.4 pmoles of Cy3 labeled probe was hybridized for 30 minutes at 37°C, 
after simultaneous denaturation of the target and the probe at 75^C for 5 minutes. A. LNA-2 

25 giving signals on chromosomes 1, 16, 9 and 15, B. LNA-3 giving bright signals on chromosomes 
1, 16 and 9, C. Dispersed LNA giving signals on chromosomes 1 and 16 only, D. LNA Block 
giving smaller signals on chromosome 1, E. DNA control oligonucleotide FISH probe giving no 
signals on any of the chromosomes. 

Figure 42. Illustrates the hybridisation of the Cy3-labelled human telomere repeat 

30 specific, LNA-2 substituted oligonucleotide probe on human metaphase chromosomes resulted 
in prominent signals on the telomeres. 
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DETAILED DESCRIPTION OF THE INVENTION 

Detection and Analysis of mRNA Splice Variants 

Alternative splicing is the process by which different mature messenger RNAs are 
5 produced from the same pre-mRNA. Because the mRNA composition of a given cell determines 
the proteins present in a cell, this process is an important aspect of a cells gene expression 
profile. Current investigations of transcriptomes (i.e., the total complexity of RNA transcripts 
produced by an organism) indicate that at least 50-60 % of the genes of complex eukaryotes 
produce more than one splice variant. The present invention provides a novel method for 

10 detecting and quantifying the levels of splice variants in complex mRNA pools using LNA 
discriminating probes and high-throughput LNA oligonucleotide microarray technology. The 
detection concept which uses internal LNA exon probes and/or splice-variant specific exon-exon 
junction or exon-intron or intron-exon (so-called merged) probes is depicted in Fig. 9. 

Intemal, exon-specific (or intron-specific) LNA oligonucleotide probes are designed and 

1 5 used to detect the relative levels of a given exon (or intron) in complex mRNA pools using 

oligonucleotide microarray technology or similar techniques. Exon-exon LNA junction probes 
are designed for multiple or all possible exon-exon combinations or exon-intron combinations. 
The LNA discriminating probes are highly specific and superior compared to DNA 
oligonucleotides due to the higher ATm of LNA probes. These probes can be used to determine 

20 the sequential order of each sub-element (i.e., exon structure or exon-intron structure) in a given 
alternatively spliced mRNA isoform, thus giving the exact composition of the mRNA. 
Subsequently, the ratios of each splice variant can be quantified using the combined readouts 
fi*om both intemal and merged LNA probes and control probes. The invention is applicable both 
in single fluor (single channel) or comparative two-fluor (two channel) microarray 

25 hybridizations. 

Several "artificial,"altematively spliced mRNA molecules may be constructed in an in 
vitro transcription vector for the production of clean IVT RNA. Both intemal and junction- 
specific LNA oligonucleotide capture probes are designed, synthesized, and spotted onto, e.g., 
Exiqon's polymer microarray platform. The resulting splice-specific microarray is used to 
30 validate the LNA discriminating probe concept by spiking the in vitro RNAs individually as well 
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as in different ratios into a complex RNA background for fluorochrome-labelling and array 
hybridization. 

The internal and merged probes of the invention can also be used in any standard method 
for the analysis of mRNA splice variants (see, for example, Yeakley et al. Nature Biotechnology 
5 20:353-358, 2002; Clark et al. Science 296:907-910, 2002; Mutch et al. Genome Biology 
2(1 2):preprint00009. 1 -0009.3 1 , 200 1 ). 

Exemplary Applications of Internal and/or Merged Probes 

The internal and/or merged probes of the invention can also be used for gene expression 

10 profiling of alternative splice variants, oligonucleotide expression microarrays, real-time PCR, 
and profiling of alternatively spliced mRNAs using microtiterplate assays or fiber-optic arrays. 

Detection and characterization of alternative splicing is particularly useful for the study 
and treatment of human disease (exonhit website, "Inaugural Splicing 2002 Concludes: 
Altemative Splicing May Make All the Difference in Discovering the Origin of Disease). In 

15 particular, RNA splicing is now widely recognized as a means to generate protein diversity. 
Altemative splicing is a key mechanism for regulating gene expression, and any mutation or 
defect in its regulation can impact considerably cell functions. Therefore, it is likely to be an 
important source of novel gene and protein targets implicated in human pathology. Industry has 
long recognized the need for innovative discovery technologies that focuses on the origin of 

20 disease for the development of novel diagnostics and therapeutics. 

In particular, there are many examples of human pathologies caused by alterations in 
normal patterns of altemative RNA splicing. Because a large number of human genes undergo 
altemative splicing, the protein isoforms that result from this process represent a major source of 
targets for commercial development of therapies and diagnostics. In particular, splicing 

25 processes play a significant role in the onset and development of cardiovascular, muscular, CNS 
diseases, and cancer. Early evidence indicates the origin of many diseases can be identified by 
examining altemative splicing - which leads to the point of intervention for discovering future 
generations of drugs. The study of splicing enables the discovery of new mechanisms 
underlying disease progression. 

30 Comparative Genomic Hybridization 
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Comparative Genomic Hybridization (CGH) is a powerful technology for detection of 
unbalanced chromosome rearrangements and holds much promise for screening and 
identification of interstitial submicroscopic rearrangements that otherwise cannot be detected 
using classical cytogenetic or FISH technologies. The adaptation of CGH onto an oligo 
5 microarray platform allows detection of small single exon deletion/duplications on a genome 
wide scale. There is a strong need for developing microarrays that can detect, e.g., single exon 
aberrations. This detection can be achieved by employing LNA mixmer oligos as capture probes 
for individual exons in selected genes. 

A model system for these methods is the Menkes loci. Menkes disease is a lethal-X linked 

10 recessive disorder associated with copper metabolism disturbance leading to death in early 

childhood. The Menkes locus has been mapped to Xql3. The gene spans about 150 kb genomic 
region, contains 23 exons, and encodes a 8.5 kb gene transcript. The gene for Menkes disease 
(now designated disATP7A) encodes a 1500 amino acid membrane-bound Cu-binding P-type 
ATPase (ATP7A). The 8.5 kb transcript is expressed in all tissues from normal individuals 

1 5 (though only trace amounts are present in liver), but is diminished or absent in Menkes disease 
patients. Several different kinds of mutations, like chromosome aberrations, point mutations and 
partial gene deletions affecting .4 TP 7^4 have been identified in MD patients. 50-mer capture 
probes with LNA spiked in every second, third, and fourth position have been designed for every 
exon (23 exons) representing ^4 TP 7/4, using the OligoDesign software tool, described herein. 

20 The C6-amino-linked capture probes were spotted onto hnmobilizer slides and hybridized with 
patient samples with Cy3 fluorescent dye and a known reference genomic sample with Cy5. 
After mixing equal amounts of the labelled DNA, the probe is hybridized it to array. The ratio 
of Cy5 signal to Cy3 for each clone indicates differences in chromosome/DNA material. For 
example, the Cy5 signal is higher than Cy3 if the patient genome has a deletion, and is lower if 

25 there is duplication. In regions that are unchanged, the Cy5:Cy3 ratio is 1:1. These methods can 
be used to analyze a number of well-characterized Menkes patients with a range of partial 
deletions of^7P7^. 

LNA oligonucleotide-based CGH makes it possible to assess a large number of 
chromosomal aberrations that are being screen for in the cytogenetic clinic. In contrast, standard 

30 FISH analysis typically only detects large chromosomal rearrangements. In desirable 
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embodiments, an array that contains a series of overlapping probes is used to detect a 
chromosomal deletion in a nucleic acid sample, such as a patient sample. 
Clinical diagnostics 

Clinical diagnosis is a key element in healthcare management and point-of-care. A large 
5 number of analyses in the hospitals are based on the use of robust, cost efficient, sensitive and 
highly specific diagnostic tests. Thus, the diagnosis of various diseases is performed with a high 
selectivity and reliability, resulting in confirmation of medical diagnosis, choice of therapy and 
follow-up treatment as well as prevention. In addition to its importance in the quality of 
healthcare provided to patients, clinical diagnosis also contributes to the control of healthcare 

10 costs. The field of clinical diagnostics involves analyzing biological fluid samples (blood, urine, 
etc.) or biopsies collected from patients in order to establish the diagnosis of diseases, whether of 
infectious, metabolic, endocrine or cancerous origin. Medical analysis of infectious diseases 
involves testing and identifying the micro-organisms causing the infection e.g. testing for and 
identifying a micro-organism in blood and determining its susceptibility to antibiotics or 

15 detecting an antigen-antibody reaction produced as a response to an attack by a micro-organism 
in the human body, e.g. testing for antibodies for the diagnosis of hepatitis. The accurate 
diagnosis of metabolic and endocrine diseases and cancers, resulting in a disease phenotype with 
a bodily imbalance, involves the measurement of diagnostic substances or elements present in 
the biological fluids or biopsies. These substances are examined and results are interpreted with 

20 reference to known normal values. 

Use of diagnostic kits in microbiological control 

The pharmaceutical, cosmetics and agri-food industries are being confronted with 
increasingly strict quality standards. Thus, the purpose of industrial microbiological control 
testing is to detect and measure the presence of potentially pathogenic microbial contaminants 

25 throughout the manufacturing process from raw materials to the finished products, as well as in 
the production environment. The obtained results are subsequently compared to the current 
regulatory guidelines and industry standards. 

Application of molecular biological techniques to in vitro diagnostics 

Recently, several different molecular biological techniques have been used successfiilly 
30 in accurate quantification of RNA levels in clinical diagnosis as well as in microbiological 
control. The applications are wide-ranging and include methods for quantification of the 
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regulation and expression of drug resistance markers in tumour cells, monitoring of the responses 
to chemotherapy, measuring the biodistribution and transcription of gene-encoded therapeutics, 
molecular assessment of the tumor stage in a given cancer, detecting circulating tumor cells in 
cancer patients and detection of bacterial and viral pathogens. The reverse transcription 
5 polymerase chain reaction (RT-PCR) is the most sensitive method for the detection of mRNA, 
including low abundant mRNAs, often obtained from limited tissue samples in clinical 
diagnostics. The application of fluorescence techniques to RT-PCR combined with suitable 
instrumentation has led to development of quantitative RT-PCR methods, combining 
amplification, detection and quantification in a closed system avoid from contamination and with 

10 minimized hands-on time. The two most commonly used quantitative RT-PCR techniques are 
the Taqman RT-PCR assay (ABI, Foster City, USA) and the Lightcycler assay (Roche, USA). A 
third method applied to detection and quantification of RNA levels is real-time nucleic acid 
sequence based amplification (NASBA) combined with molecular beacon detection molecules. 
NASBA is a singe-step isothermal RNA-specific amplification method that amplifies mRNA in a 

1 5 double stranded DNA environment, and this method has recently proven usefiil in the detection 
of various mRNAs and in the detection of both viral and bacterial RNA in clinical samples. 
Finally, the recent explosion in microarray technology holds the promise of using microarrays in 
clinical diagnostics. For example van't Veer et al. (Nature 2002: 415, 31) describe the successful 
use of microarrays in obtaining digital mRNA signatures from breast tumors and the use of these 

20 signatures in the precise prediction of the clinical outcome of breast cancer in patients. 

The success of exploiting molecular biological techniques in diagnostics and diagnostic 
kits depends on continuous optimization of the technologies and the development of new robust 
and cost-effective technology platforms for producing accurate, reproducible and valid clinical 
data. Locked nucleic acid (LNA) oligonucleotides constitute a novel class of bicyclic RNA 

25 analogs having an exceptionally high affinity and specificity toward their complementary DNA 
and RNA target molecules. Besides increased thermal stability, LNA-containing 
oligonucleotides show significantly increased mismatch discrimination, and allow full control of 
the melting temperature across microarray hybridizations. The LNA chemistry is completely 
compatible with conventional DNA phosphoramidite chemistry and thus LNA substituted 

30 oligonucleotides can be designed to optimize performance. LNA oligonucleotides would be well- 
suited for large-scale clinical studies providing highly accurate genotyping by direct competitive 
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hybridization of two allele-specific LNA probes to e.g. microarrays of immobilized patient 
amplicons. In addition, the use of LNA substituted oligonucleotides would increase both 
sensitivity and specificity in detection and quantification of mRNA levels in clinical samples, 
either by quantitative RT-PCR, quantitative NASBA or oligonucleotide microarrays, compared 
5 with DNA probes. Application of LNA oligonucleotides into diagnostic kits would thus 

significantly enhance their performance. Finally, the use of LNA substituted oligonucleotides 
would increase the sensititity and specificity in the detection of alternatively spliced mRNA 
isoforms and non-coding RNAs either by homogeneous assays (Taqman assay, Lightcycler 
assay, NASBA) or by oligonucleotide microarrays in a massive parallel analysis setup. 

1 0 Optimized Nucleic Acids of the Invention 

Decreasing the variation in melting temperatures (Tm) of a population of nucleic acids 
allows the nucleic acids to hybridize to target molecules under similar binding conditions, 
thereby simplifying the simultaneous hybridization of multiple nucleic acids. Similar melting 
temperatures also allow the same hybridization conditions to be used for multiple experiments, 

15 which is particularly useful for assays involving hybridization to nucleic acids of varying "AT" 
content. For example, current methods often require less stringent conditions for hybridization 
of nucleic acids with high "AT" content compared to nucleic acids with low "AT" content. Due 
to this variation in hybridization stringency, current methods may require significant trial and 
error to optimize the hybridization conditions for each experiment. 

20 To overcome limitations in current nucleic acid hybridization and/or amplification 

techniques, we have developed populations of nucleic acid probes or primers with minimal 
variation in melting temperature (U.S. S.N. 60/410,061). For example, the unique properties of 
LNA nucleotide analogs increase their binding affinity for DNA and RNA. The stability of 
duplexes can generally be ranked as follows: DNA:DNA < DNAiRNA < RNAiRNA < 

25 LNA:DNA < LNA:RNA < LNA:LNA. The DNA:DNA duplex is thus the least stable and the 
LNA:LNA duplex the most stable. The affinity of the LNA nucleotides A and T corresponds 
approximately to the affinity of DNA G and C to their complementary bases. General 
substitution of one or more A and T nucleotides with LNA A and LNA T in DNA 
oligonucleotides is therefore a simple way of equalizing differences in Tm. Furthermore, the 

30 mean melting temperature is increased significantly, which is often important for shorter 
oligonucleotides. For example, predictions of melting temperature of all possible 9-mer 
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oligonucleotides have shown that the mean temperature increases from 39.7 to 59.3 °C by 
substituting all DNA A and T nucleotides with LNA A and T nucleotides. The variance in Tm of 
all 9-mers furthermore decreases from 59.6 °C for DNA oligonucleotides to only 4.7 °C for the 
LNA substituted oligonucleotides. The estimations are based on the latest LNA T^ prediction 
5 algorithms such as those disclosed herein, which have a variance of 6-7 

If desired, the capture efficiency of one or more nucleic acids can be increased by 
including any of the high affinity nucleotides (e.g., LNA units) described herein within the 
nucleic acids. The examples herein also provide algorithms for optimizing the substitution 
patterns of the nucleic acids to minimize self-complementarity that may otherwise inhibit the 

10 binding of the nucleic acids to target molecules. 

For various applications of the nucleic acids and arrays of the invention, LNA A and 
LNA T substitutions are made to equalize the melting temperatures of the nucleic acids. In other 
embodiments, LNA A and LNA C substitutions are made to minimize self-complementarity and 
to increase specificity. LNA C and LNA T substitutions also minimize self-complementarity. 

15 Additionally, oligonucleotides containing LNA C and LNA T are desirable because these 
modified nucleotides are easy to synthesis and are especially useful for applications such as 
antisense technology in which minimizing cost is especially desirable. 

The following non-limiting examples are illustrative of the invention. All documents 
mentioned herein are incorporated herein by reference in their entirety. In the following 

20 Examples, compound reference numbers designate the compound as shown in Scheme 1 and 2 
herein. 

Example 1: The Use of LNA-inodified Oligonucleotides in Microarrays Provide 
Significantly Improved Sensitivity and Specificity in Expression Profiling 

25 This example demonstrates the advantages of using LNA oligonucleotide microarrays in 

gene expression profiling experiments. Capture probes for the Saccharomyces cerevisiae genes 
SWI5 (YDR146C) and THI4 (YGR144W) were designed as 50-mer standard DNA and different 
LNA/DNA "mixmer" oligonucleotides (i.e., oligonucleotides containing both LNA and DNA 
nucleotides) respectively, for comparison (Table 2). In addition, 40-mer oligonucleotides were 

30 designed as truncated versions of the 50-mer capture probes (Table 2). The specificity of the 
LNA oligoarrays was addressed by introducing 1-5 consecutive mismatches positioned in the 
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middle of 40-mer LNA/DNA mixmer capture probes with LNA in every fourth position. To 
assess the sensitivity of DNA versus LNA capture probes in complex hybridization mixtures, in 
vitro synthesized yeast RNA for either SWI5 or THI4 was spiked into Caenorhabditis elegans 
total RNA for cDNA target synthesis. These experiments are described further below. 
5 Cultivation of Caenorhabditis elegans worms 

Mixed stage C. elegans cultures were grown according to standard methods. Samples 
were harvested by centrifugation at S^OOOxg, suspended in RNA Later storage buffer (Ambion, 
USA), and immediately frozen in liquid nitrogen. 
RNA extraction 

10 RNA was extracted from the worm samples using the FastRNA® Kit, GREEN (Q-BIO) 

essentially according to the suppliers' instructions. 
In vitro RNA synthesis 

Amplification of the yeast genes was performed using standard PGR with yeast genomic 
DNA as the template. In the first step, a forward primer containing a restriction enzyme site and 

15 a reverse primer containing a universal linker sequence were used. In the second PGR reaction, 
the reverse primer was exchanged with a nested primer containing a poly-T20 tail and a 
restriction enzyme site. The DNA fragments were ligated into the pTRIampl 8 vector (Ambion, 
USA) using the Quick Ligation Kit (New England Biolabs, USA) according to the supplier's 
instructions and transformed into E. coli DH-5a by standard methods. The PGR clones were 

20 sequenced using Ml 3 forward and Ml 3 reverse primers on an ABI 377 (Applied Biosystems, 
USA). Synthesis of in vitro RNA was carried out using the MEGAscript™ T7 Kit (Ambion, 
USA) according to the manufacturer's instructions. 
Design and synthesis of the LNA capture probes 

To design the capture probes, regions with unique mRNA sequence of the selected target 

25 genes were identified. The optimal 50-mer oligonucleotide sequences with respect to Tm, self- 
complementarity, and secondary structure were selected. LNA modifications were incorporated 
to increase affinity and specificity. 
Printing of the LNA microarrays 

The microarrays were printed on Immobilizer™ MicroArray Slides (Exiqon, Denmark) 

30 using the Biochip One Arrayer fi-om Packard Biochip technologies (Packard, USA). The arrays 
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were printed with a spot volume of 2 x 300 pi of a 10 |LiM capture probe solution. Four replicas 

of the capture probes were printed on each slide. 

Synthesis of fluorochrome labelled first strand cDNA fi-om total RNA 

Ten ng of S. cerevisiae in vitro synthesized RNA (either SWI5 or THI4) was combined 
5 with 10 ^ig of C. elegans total RNA and 5 |ig oligo dT primer (T20VN) in an RNase free, pre- 
siliconized 1 .5 mL tube, and the final volume was adjusted with DEPC-water to 8 |LiL. The 
reaction mixture was heated at +70°C for 10 minutes, quenched on ice for 5 minutes, and spun 
for 20 seconds, followed by addition of 1 |iL SUPERase-In™ (20U/)LiL, RNAse inhibitor, 
Ambion, USA), 4 |iL SxRTase buffer (Invitrogen, USA), 2 ^iL O.IM DTT (Invitrogen, USA), 1 

10 |iL dNTP (20mM dATP, dGTP, dTTP; 0.4 mM dCTP in DEPC-water, Amersham Pharmacia 
Biotech, USA), and 3 |LiL Cy3™-dCTP or Cy5™-dCTP (Amersham Pharmacia Biotech, USA). 
First strand cDNA synthesis was carried out by adding 1 \xL of Superscript™ II (Invitrogen, 200 
U/mL), mixing, and incubating the reaction mixture for one hour at 42''C. An additional 1 |iL of 
Superscript™ II was added, and the cDNA synthesis reaction mixture was incubated for an 

15 additional one hour at 42°C; the reaction was stopped by heating at IQ^C for 5 minutes, and 

quenching on ice for 2 minutes. The RNA was hydrolyzed by adding 3 |iL of 0.5 M NaOH, and 
incubating at 70°C for 15 minutes. The samples were neutralized by adding 3 p.L of 0.5 M HCl, 
and purified by adding 450 ^iL IxTE buffer, pH 7.5 to the neutralized sample and transferring 
the samples onto a Microcon-30 concentrator. The samples were centrifuged at 14000xg in a 

20 microcentrifuge for ~8 minutes, the flow-through was discarded and the washing step was 

repeated twice by refilling the filter with 450 nl IxTE buffer and by spinning for -12 minutes. 
Centrifiigation was continued until the volume was reduced to 5 (iL, and finally the labelled 
cDNA probe was eluted by inverting the Microcon-30 tube and spinning at lOOOxg for 3 
minutes. 

25 Hybridization with fluorochrome-labelled cDNA 

The arrays were hybridized overnight using the following protocol. The Cy3™ or Cy5™- 
labelled cDNA samples were combined in one tube followed by addition of 3 |iL 20xSSC 
(3xSSC final), 0.5 ^iL 1 M HEPES, pH 7.0 (25 mM final), 25 |ig yeast tRNA (1.25 ^ig/^iL final), 
10 Jig PolyA blocker (0.5 \i§J\ih final), 0.6 |iL 10% SDS (0.3% final), and DEPC-treated water 

30 to 20 iLiL final volume. The labelled cDNA target sample was filtered in a Millipore 0.22 micron 


spin column according to the manufacturer's instructions (Millipore, USA), and the probe was 
denatured by incubating the reaction at 100°C for 2 minutes. The sample was cooled at 20-25°C 
for 5 minutes by spinning at maximum speed in a microcentrifuge. A LifterSlip (Erie Scientific 
Company, USA) was carefully placed on top of the microarray spotted on Immobilizer™ 
5 MicroArray Slide, and the hybridization mixture was applied to the array from the side. An 
aliquot of 30 [iL of 3xSSC was added to both ends of the hybridization chamber, and the 
Immobilizer™ MicroArray Slide was placed in the hybridization chamber. The chamber was 
sealed watertight and incubated at 65°C for 16-18 hours submerged in a water bath. After 
hybridization, the slide was removed carefully from the hybridization chamber and washed using 

10 the following protocol. The Lifterslip coverslip was washed off in 2 x SSC, pH 7.0 containing 
0.1% SDS at room temperature for one minute, followed by washing of the microarrays 
subsequently in 1.0 x SSC, pH 7.0 at room temperature for one minute, and then in 0.2 x SSC, 
pH 7.0 at room temperature for one minute. Finally, the slides were washed for 5 seconds in 
0.05 X SSC, pH 7.0. The slides were then dried by centrifugation in a swinging bucket rotor at 

15 approximately 600 rpm for 5 minutes. 
Data analysis 

Following washing and drying, the slides were scanned using a ScanArray 4000XL 
scanner (Perkin-Elmer Life Sciences, USA), and the array data were processed using the 
GenePix™ Pro 4.0 software package (Axon, USA). 
20 Results 

Incorporation of LNA nucleotides at every third nucleotide position in standard 50-mer 
expression array oligonucleoitde capture probes resulted in a 3-fold increase in fluorescence 
intensity levels, when hybridized under standard stringency conditions (Figs. 4A and 4B). When 
the hybridization temperature is increased from 65 °C to 70 "^C, the capture of the SWI5 spike 

25 mRNA by LNA 50-mer oligos is increased by 8-fold relative to the DNA controls. Thus, it can 
clearly be concluded that oligonucleotides containing LNA units are more sensitive in expression 
profiling compared to oligonucleotides containing only DNA units. The specificity of 40-mer 
LNA/DNA mixmer capture probes in the discrimination of highly homologous target sequences 
was addressed by introducing 1 -5 consecutive mismatches in the middle of SWI5 and THI4 

30 capture oligos together with the corresponding DNA controls. As demonstrated in Figs. 5A and 
5B, the LNA-spiked (LNA modification at every fourth nucleotide position) 40-mer triple 
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mismatch oligos showed a 3-fold signal intensity decrease relative to the perfectly matched 
duplexes, whereas the corresponding 40-mer standard DNA capture probes did not form 
duplexes under standard hybridization stringency. Further, the 40-mer perfect match LNA 
capture probes showed a 5-fold to 14-fold increase in the intensity levels compared to DNA 
5 oligonucleotides under standard hybridization conditions. Capture probes of other lengths and/or 
with other LNA substitution patterns can be used similarly. 

Table 2. DNA and LNA-modified SWI5 (YDR146C) and THI4 (YGR144W) oligonucleotide 
capture probes. LNA modifications are depicted by uppercase letters in the sequence, mt denotes 
10 the number of mismatches (bolded) in the center of the oligonucleotide with respect to its target 
cDNA (mRNA), and "mC" denotes LNA methyl cytosine. (SEQ ID NOs: ) 


Oligo Name 



Sequence 

YDR146C-40 



acggggattatggtttcgccaatgaaaactaatcaaaggt 

YDR146C-40_ 

_mt1 


acggggattatggtttcgcctatgaaaactaatcaaaggt 

YDR146C-40_ 

_mt2 


acggggattatggtttcgcgtatgaaaactaatcaaaggt 

YDR146C-40_ 

_mt3 


acggggattatggtttcgggtatgaaaactaatcaaaggt 

YDR146C-40_ 

_mt4 


acggggattatggtttcgggtttgaaaactaatcaaaggt 

YDR146C-40_ 

_mt5 


acggggattatggtttcgggttagaaaactaatcaaaggt 

YDR146C-40_ 

_LNA4 


acGgggAttaTggtTtcgmCcaaTgaaAactAatcAaagGt 

YDR146C-40_ 

_LNA4_ 

^mt1 

acGgggAttaTggtTtcgmCctaTgaaAactAatcAaagGt 

YDR146C-40_ 

_LNA4_ 

_mt2 

acGgggAttaTggtTtcgmCgtaTgaaAactAatcAaagGt 

YDR146C-40_ 

_LNA4_ 

_mt3 

acGgggAttaTggtTtcgGgtaTgaaAactAatcAaagGt 

YDR146C-40_ 

_LNA4_ 

_mt4 

acGgggAttaTggtTtcgGgttTgaaAactAatcAaagGt 

YDR146C-40, 

_LNA4. 

_mt5 

acGgggAttaTggtltcgGgttAgaaAactAatcAaagGt 

YDR146C-50 



tgggaatggaacggggattatggtttcgccaatgaaaactaatcaaaggt 

YDR146C-50, 

_mt1 


tgggaatggaacggggattatggtatcgccaatgaaaactaatcaaaggt 

YDR146C-50_ 

_mt2 


tgggaatggaacggggattatggtaacgccaatgaaaactaatcaaaggt 

YDR146C-50. 

_mt3 


tgggaatggaacggggattatggtaaggccaatgaaaactaatcaaaggt 

YDR146C-50^ 

_mt4 


tgggaatggaacggggattatggaaaggccaatgaaaactaatcaaaggt 

YDR146C-50_ 

_mt5 


tgggaatggaacggggattatggaaagcccaatgaaaactaatcaaaggt 

YDR146C-50_ 

_LNA2 


TgGgAaTgGaAcGgGgAtTaTgGtTtmCgmCcAaTgAaAamCtAaTcAaAgGt 

YDR146C-50_ 

_LNA3 


TggGaaTggAacGggGatTatGgtTtcGccAatGaaAacTaaTcaAagGt 

YDR146C-50_ 

_LNA4 


TgggAatgGaacGgggAttaTggtTtcgmCcaaTgaaAactAatcAaagGt 

YDR146C-50, 

_LNA5 


TgggaAtggaAcgggGattaTggttTcgccAatgaAaactAatcaAaggt 
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Oligo Name 



Sequence 

YDR146C-50_ 

LNA6 


TgggaaTggaacGgggatTatggtTtcgccAatgaaAactaaTcaaagGt 

YDR146C-50_ 

LNA3_ 

mtl 

TggGaaTggAacGggGatTatGgtAtcGccAatGaaAacTaaTcaAagGt 

YDR146C-50_ 

LNA3_ 

,mt2 

TggGaaTggAacGggGatlatGgtAacGccAatGaaAacTaaTcaAagGt 

YDR146C-50_ 

LNA3_ 

.mt3 

TggGaaTggAacGggGatTatGgtAagGccAatGaaAacTaaTcaAagGt 

YDR146C-50_ 

LNA3_ 

mt4 

TggGaaTggAacGggGatTatGgaAagGccAatGaaAacTaaTcaAagGt 

YDR146C-50_ 

LNA3_ 

.mt5 

TggGaaTggAacGggGatTatGgaAagmCccAatGaaAacTaaTcaAagGt 

YGR144W-40 



ttgctgaactggatggattaaaccgtatgggtccaacttt 

YGR144W-40. 

_mt1 


ttgctgaactggatggatttaaccgtatgggtccaacttt 

YGR144W-40. 

_mt2 


ttgctgaactggatggatataaccgtatgggtccaacttt 

YGR144W-40. 

_mt3 


ttgctgaactggatggatattaccgtatgggtccaacttt 

YGR144W-40. 

_mt4 


ttgctgaactggatggatatttccgtatgggtccaacttt 

YGR144W-40. 

_mt5 


ttgctgaactggatggatatttgcgtatgggtccaacttt 

YGR144W-40. 

_LNA4 


ttGctgAactGgatGgatTaaamCcgtAtggGtccAactTt 

YGR144W-40. 

_LNA4 

_mt1 

ttGctgAactGgatGgatTtaamCcgtAtggGtccAactTt 

YGR144W-40. 

_LNA4 

_mt2 

UGctgAactGgatGgatAtaamCcgtAtggGtccAactTt 

YGR144W-40. 

_LNA4 

_mt3 

ttGctgAactGgatGgatAttamCcgtAtggGtccAactTt 

YGR144W-40. 

_LNA4 

_mt4 

ItGctgAactGgatGgatAtttmCcgtAtggGtccAactTt 

YGR144W-40. 

_LNA4 

_mt5 

ttGctgAactGgatGgatAtttGcgtAtggGtccAactTt 

YGR144W-50 



ggtatggaagttgctgaactggatggattaaaccgtatgggtccaacttt 

YGR144W-50. 

_mt1 


ggtatggaagttgctgaactggatcgattaaaccgtatgggtccaacttt 

YGR144W-50. 

_mt2 


ggtatggaagttgctgaactggatccattaaaccgtatgggtccaacttt 

YGR144W-50. 

_mt3 


ggtatggaagttgctgaactggaaccattaaaccgtatgggtccaacttt 

YGR144W-50. 

_mt4 


ggtatggaagttgctgaactggaacctttaaaccgtatgggtccaacttt 

YGR144W-50. 

_mt5 


ggtatggaagttgctgaactggaacctataaaccgtatgggtccaacttt 

YGR144W-50. 

_LNA3 


GgtAtgGaaGttGctGaamCtgGatGgaTtaAacmCgtAtgGgtmCcaActTt 

YGR144W-50. 

_LNA3 

_mt1 

GgtAtgGaaGttGctGaamCtgGatmCgaTtaAacmCgtAtgGgtmCcaActTt 

YGR144W-50. 

_LNA3 

_mt2 

GgtAtgGaaGttGctGaamCtgGatmCcaTtaAacmCgtAtgGgtmCcaActTt 

YGR144W-50. 

_LNA3 

_mt3 

GgtAtgGaaGttGctGaamCtgGaamCcaTtaAacmCgtAtgGgtmCcaActTt 

YGR144W-50. 

_LNA3 

_mt4 

GgtAtgGaaGttGctGaamCtgGaamCctTtaAacmCgtAtgGgtmCcaActTt 

YGR144W-50. 

_LNA3 

_mt5 

GgtAtgGaaGttGctGaamCtgGaamCctAtaAacmCgtAtgGgtmCcaActTt 
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Example 2: Detection of Alternative Splice Isoforms Using Exon-Specific, Internal LNA 
Capture Probes in the Caenorhabditis elegans Gene let-2 

Capture probe design 
Finding the regions of interest 
5 From the database "intronerator" (W. Jim Kent and Al M .Zahler, "The Intronerator: 

exploring introns and alternative splicing in C. elegans," Nucleic Acids Research Jan. 1, 2000; 
28(l):91-93 and "Conservation, Regulation, Synteny, and Introns in a Large-scale C. briggsae-C. 
elegans Genomic Alignment" in Genome Research August, 2000; 10(8):1 1 15-1 125) as well as 
scientific literature, the C elegans Let-2 gene encoding type IV collagen was found according to 

10 the following criteria. The generation of mature mRNA desirably involves either complete exon 
or intron skipping. ESTs (expressed sequence tags) desirably indicate different isoforms. If 
ESTs were only different from the gene annotation(s), this could simply mean that the prediction 
is wrong, and nothing more. Desirably, there are different EST splice indications in different 
developmental stages. Two gene prediction algorithms (e.g., GeneFinder and Genie) desirably 

15 agree upon the number of genes in a coding segment. Exons of interest (e.g., exons being 
skipped and their flanking exons) in the C. elegans gene T01D3.3 desirably exceed 70 base- 
pairs. Other genes of interest may be selected using one or more of the above criteria or using 
other criteria, such as the medical relevance of the gene. 

20 Determining melting temperatures and palindromic properties of the C elegans let-2 gene/exons 

8, 9, JO, and 11 -specific capture probes 

The script PICK70 (which was kindly provided by Jingchun Zhu from the Joe DeRisi 

Laboratory and which is publicly available) was used to run a sliding 50 base-pair window 

across the regions in which an oligonucleotide capture probe should be designed. The output 
25 data were saved for later. 

Determining the uniqueness of the regions 

All regions were compared using a publicly available BLAST program to the complete 

set of annotated transcripts from the C. elegans genome downloaded from NCBI. For each 

region a list with the location of all BLAST hits was made. 
30 Choice of Desirable Tmfor Capture Probes 
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From the PICK70 output, the distribution of melting temperatures for all possible 
oligonucleotides was collected. As these centered around approximately 80°C, this temperature 
was chosen as the desirable temperature. 

For each region, an oligonucleotide with a palindromic value below 100 (default value in 
5 PICK70, value based on Smith- Waterman algorithm) and with a melting temperature closest to 
SO^C was picked. The location of the oligonucleotide within the region was then compared to 
the list made using the above BLAST search. If the oligonucleotide did not coincide with a 
BLAST hit exceeding around 25 (consecutive) base-pairs, this oligonucleotide sequence was 
chosen as a 50-mer capture probe. Otherwise, a new oligo sequence was picked from the 
10 PICK70 output. 

Checking Probe Sequences 

The selected 50-mer oligonucleotide sequences were "BLASTed" against the C. elegans 
transcripts again, as described above. 
Accounting for the Introns. 
1 5 The oligonucleotide sequences were "BLASTed" against the complete C. elegans 

genome. The matches were run against a list made from the GenBank reports of the complete 
genome, indexing whether positions in the genome were genie or intergenic. 

It was checked to determine whether new hits to genie regions appeared (compared to the 
initial BLAST search using the PICK70 output). If this was not case, the oligonucleotide 
20 sequences were selected for capture probe synthesis. 
Design of the LNA-modified Capture Probes 

For the LNA-modified oligonucleotide capture probes, every fourth DNA nucleotide was 
substituted with an LNA nucleotide, as shown in Table 3. The oligonucleotides were 
synthesized with an anthraquinone (AQ) moiety at the 5 '-end of each oligonucleotide (e.g., as 
25 described in allowed U.S. S.N. 09/61 1,833), followed by a hexaethyleneglycol tetramer linker 
region and the LNA/DNA mixmer capture oligonucleotide sequence. 
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Table 3. C. elegans let-2 gene/exons 8, 9, 10, and 1 1 -specific capture probes (SEQ ID NOs: ) 


Capture probe 

Sequence (LNA=uppercase, DNA=lowercase letters) 

CE42.08-0HEG4 

GgctGgatmCcccAggaAaccmCaggAatcGgaaGcatTggamCcaaAaggAg 

CE42.09-0HEG4 

mCaccGgatmCcggmCtcaAttgTcggAcctmCgcgGaaamCcctGgagAaaaGg 

CE42.10-0HEG4 

TccgmCcagGcccAatcGcctmCcacmCatgTccaAgggAaccAttaTcggTc 

CE42.11-0HEG4 

GagcmCaggAgagGgagGtcaAcgcGgttAcccAggaAatgGaggActcTc 


Strains and Growth Conditions 
5 C. elegans wild-type strain (Bristol-N2) was maintained on nematode growth medium 

(NG) plates seeded with Escherichia coli strain OP50 at 20°C, and the eggs and LI larvae were 
prepared as described in Hope, 1. A. (ed.) " C. elegans - A Practical Approach Oxford 
University Press 1999. The samples were immediately flash frozen in liquid N2 and stored at 
-80''''C until RNA isolation. 

1 0 Isolation of Total RNA 

A 100 |Lil aliquot of packed C elegans worms from a LI larvae population was 
homogenized using the FastPrep Biol 01 from Kem-En-Tec for 1 minute at speed 6 followed by 
isolation of total RNA from the extracts using the FastPrep Biol 01 kit (Kem-En-Tec) according 
to the manufacturer's instructions. A 50 ^il aliquot of packed C. elegans eggs was homogenized 

15 in lysis buffer (RNeasy total RNA purification kit, QIAGEN) containing quartz sand for 3 
minutes using a Pellet Pestle Motor followed by isolation of total RNA according to the 
manufacturer's manual. 

The eluted total RNA from worms (LI larvae) as well as eggs was ethanol precipitated 
for 24 hours at - 20^C by addition of 2.5 volumes of 96% EtOH and 0.1 volume of 3M Na- 

20 acetate, pH 5.2 (Ambion, USA), followed by centrifugation of the total RNA sample for 30 
minutes at 13200 rpm. The total RNA pellet was air-dried and redissolved in 6 fxl (worms) or 
2.5 |xl (eggs) of diethylpyrocarbonate (DEPC)-treated water (Ambion, USA) and stored at -80°C. 
Reverse transcription (RTJ-PCR 

Total RNA (1 .5 ^ig) fi-om eggs or 1 |Xg total RNA from worms (LI larvae) were mixed 

25 with 5 |ig oligo(dT)l 2-1 8 primer (Amersham Pharmacia Biotech, USA) and 0.5 |ig of random 
hexamers, pd(N\ (Amersham Pharmacia Biotech, USA) and DEPC-treated water to a final 


volume of 7 The mixture was heated at 70°C for 10 minutes, quenched on ice for 5 minutes, 
followed by addition of 20 units of Superasin RNase inhibitor (Ambion, USA), 4 )il of 5 x 
Superscript buffer (Life Technologies, USA), 2 |il of 100 mM DTT, 1 |il of dNTP solution (20 
mM each dATP, dGTP, dTTP and dCTP, Amersham Pharmacia Biotech, USA), and 3 |Lil of 
5 DEPC-treated water. 

The primers were pre-annealed at 3TC for 5 minutes, followed by addition of 400 units 
of Superscipt II reverse transcriptase (Invitrogen, USA). First strand cDNA synthesis was 
carried out at 37^C for 30 minutes, followed by 2 hours at 42°C, and the reaction was stopped by 
incubation at 70^C for 5 minutes, followed by incubation on ice for 5 minutes. 

10 Unincorporated dNTPs were removed by gel filtration using MicroSpin S-400 HR 

columns as described below. The column was pre-spun for 1 minute at 735 x g in a 1.5 ml tube, 
and the column was placed in a new 1 .5 ml tube. The cDNA sample was slowly applied to the 
top center of the resin and spun at 735 x g for 2 minutes. The eluate was collected. The volume 
of the eluate was adjusted to 50 |Lil with TE-buffer pH 7.0 before being used as the template for 

15 linear PGR. Four |il template (RT from eggs or worms) was combined with 1 )li1 dNTP solution 
(lOmM each dATP, dGTP, dTTP and dCTP, Amersham Phamacia Biotech, USA), 1 |il of each 
primer ( 20|iM CE42.07 sense: gatcgaattcctccaggagagaagggagatg (SEQ ID NO: ), and CE42.12 
antisense: 5'gatcaagcttatctcttcctgggtatccagctt (SEQ ID NO: )), 5 ^il 10 x AmpliTaq Gold 
Polymerase buffer, 5|Lil 25 mM MgCl2, 0.5 [i\ AmpliTaq Gold DNA polymerase (5U/|il, Applied 

20 Biosystems), 2 |li1 Cy3-dCTP (Amersham Phamacia Biotech, USA) (eggs) or 2 ^1 Cy5-dCTP 
(Amersham Pharmacia Biotech, USA) (worms), and 31.5 |il DEPC-treated water to a final 
volume of 50 |Lil. The PGR reactions were carried out using the following program: 95°G for 5 
minutes followed by 30 cycles of PGR using the following cycling program (denaturation at 
95°C for 45 seconds, annealing at 60°G for 30 seconds, and extension at 72°C for 1 minute) 

25 followed by a final extension step at 72°G for 10 minutes and incubation on ice for 5 minutes. 

Purification of the PGR amplicons from eggs as well as worms was performed using a 
Qiaquick PGR purification kit (QIAGEN) according to the manufacturer's instructions. 
Fluorochrome-labeling of the let-2 cDNA Fragments using Primer Extension 

Four (4) \i\ template (RT from eggs or worms) was combined with 1 |Lll dNTP solution 

30 (lOmM each dATP, dGTP, dTTP and dGTP, Amersham Phamacia Biotech, USA), 1 ^il of each 
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primer (20|iM CE42.12 antisense 5'gatcaagcttatctcttcctgggtatccagctt (SEQ ID NO: )), 5 ^il 10 x 
AmpliTaq Gold Polymerase buffer, 5fil 25 mM MgClj, 0.5 |il AmpliTaq Gold DNA polymerase 
(5U/|il, Applied Biosystems), 2 |li1 Cy3-dCTP (Amersham Phamacia Biotech, USA) (eggs) or 2 
^1 Cy5-dCTP (Amersham Phamacia Biotech, USA) (worms), and 31.5 ^1 DEPC-treated water to 
5 a final volume of 50 |il. The PGR reactions were carried out using the following program: 95°C 
for 5 minutes followed by 30 cycles of PGR using the following cycling program (deriaturation 
at 95 °G for 45 seconds annealing at 60°G for 30 seconds extension at 72'^G for 1 minute) 
followed by a final extension step at 72°G for 10 minutes and incubation on ice for 5 minutes. 
Purification of the PGR amplicons from eggs as well as worms were performed using a 

10 Qiaquick PGR purification kit (QIAGEN) according to the manufacturer's instructions. 

Unincorporated dNTP nucleotides were removed by gel filtration using MicroSpin S-400 HR 
columns as described above before the eluted, fluorochrome-labelled DNA fragments were 
stored at -20°G in the dark until microarray hybridization. 
Printing and Coupling of the C. elegans Let-l Exon 8-11 Microarrays 

15 The C. elegans gene Let-2/exon 8-1 1 capture probes were synthesized with a 5' 

anthraquinone (AQ)-modification, followed by a hexaethyleneglycol-4 (HEG4) linker (Table 3). 
The capture probes were first diluted to a 10 |LiM final concentration in 100 mM Na-phosphate 
buffer pH 7.0 and spotted on Euray GOP microarray slides using the Biochip Arrayer One 
(Packard Biochip Technologies) with a spot volume of 300 pi and 300 |Lim between the spots. 

20 The capture probes were immobilized onto the microarray slide by UV irradiation in a 

Stratalinker for 90 seconds at full power (Stratagene, USA). Non-immobilized capture probe 
oligonucleotides were removed from the slides by washing the slides for Vi hour in 30% acetone 
before rising in milli-Q H2O. After washing, the slides were centrifuged at 800 rpm for 2 
minutes and stored in a slide box until microarray hybridization. 

25 Comparative Hybridization of the C. elegans microarrays and Post-hybridization Washes 

The slides were hybridized with 2.5 )il of the Gy3-labelled and 2.5 jil of the Gy5-labelled 
target preparation from eggs and worms, respectively, as described above (see "Reverse 
transcription (RT)-PGR" section) in 25 |il of hybridization solution, containing 25 mM HEPES, 
pH 7.0, 3 X SSG, 0.3% SDS, and 25 |ig of yeast tRNA. The target probe was filtered in a 

30 Millipore 0.22 micron spin column (Ultrafree-MG, Millipore, USA), denatured by incubation at 
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1 OO^C for 5 minutes, cooled at room temperature for 5 minutes, and then carefully applied onto 
the prepared microarray. One-third of a cover slip was laid over the microarray, and the 
hybridization was performed for 16-18 hours at 65°C in a hybridization chamber (DieTech, 
model Joe deRisi, USA). 

5 Following hybridization, the slides were washed sequentially by plunging gently in 2 x 

SSC/0.1% SDS at room temperature until the cover slip falls off into the washing solution, then 
in Ix SSC pH 7.0 (150 mM NaCl, 15 mM Sodium Citrate) at room temperature for 1 minute, 
then in 0.2 x SSC, pH 7.0 (30 mM NaCl, 3 mM Sodium Citrate) at room temperature for 1 
minute, and finally in 0.05 x SSC (7.5 mM NaCl, 0.75 mM Sodium Citrate) for 5 seconds, 

10 followed by drying of the slides by spinning at 500 rpm for 5 minutes. The slides were stored in 
a slide box in the dark until scanning. 
Microarray data analysis 

The C elegans let-2 gene microarray was scanned in an ArrayWoRx Scanner (Applied 
Precision, USA) using an exposure time of 5 seconds, resolution of 5.0, and high (high level) 

1 5 sensitivity. The hybridization data were analyzed using the ArrayVision image analysis software 
package 5.1 (IMAGING Research Inc., USA). The detection principle for alternative exon 
skipping the C elegans let-2 gene is shown in Fig. 6. As demonstrated in Fig. 7, analysis of the 
comparative hybridization data from the C elegans Let'2 exon 8-1 1 array demonstrates detection 
of alternative exon skipping of the let-2 exon 9 (eggs) and exon 10 (LI larvae) using LNA- 

20 modified 50-mer capture probes. Capture probes of other lengths and/or with other LNA 
substitution patterns can be used similarly. 

Example 3: Improved Sensitivity in the Specific Detection of the C elegans Gene T01D3.3 
Exon 4 Using LNA-Modified Oligonucleotide Capture Probes 

25 Capture Probe Design: The design method of exon-specific capture probes for the C. elegans 
gene T01D3.3 exon 4 has been described in example 2. 

Design of the LNA-modified Capture Probes: For the LNA-spiked oligonucleotide capture 
probes, every fourth DNA nucleotide was substituted with an LNA nucleotide, as shown in Table 
30 4. 
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Table 4. C. elegans gene T01D3.3/exon 4-specific capture probes. (SEQ ID NOs: ) 


Capture probes 

Sequence (LNA=uppercase, DNA=lowercase letters) 

CEgene26.04-70 

ggctggaacagaagtttgttggtgcgtgacaaggtatggaagaagattatccggaaaagaaagcaa 
agac 

CEgene26.04-50 

ggctggaacagaagtttgttggtgcgtgacaaggtatggaagaagattat 

CEgene26.04-40 

ggctggaacagaagtttgttggtgcgtgacaaggtatgga 

CEgene26.04-30 

gaacagaagtttgttggtgcgtgacaaggt 

CEgene26.04-50HEG2 

GgctGgaamCagaAgttTgttGgtgmCgtgAcaaGgtaTggaAgaaGattAt 

CEgene26.04.50HEG4 

GgctGgaamCagaAgttTgttGgtgmCgtgAcaaGgtaTggaAgaaGattAt 

CEgene26.04-40HEG2 

GgctGgaamCagaAgttTgttGgtgmCgtgAcaaGgtaTgga 

CEgene26.04-40HEG4 

GgctGgaamCagaAgttTgttGgtgmCgtgAcaaGgtaTgga 

CEgene26.04.30HEG2 

GaacAgaaGtttGttgGtgcGtgamCaagGt 

CEgene26.04-30HEG4 

GaacAgaaGtttGttgGtgcGtgamCaagGt 


Cultivation of Caenorhabditis elegans Worms 

Mixed stage C elegans cultures were grown according to standard methods. Samples 
5 were harvested by centrifugation at 3000xg, suspended in RNA Later (Ambion, USA), and 
immediately frozen in liquid nitrogen. 
mRNA Isolation from C elegans Mixed Stages Worms 

Poly(A)^RNA was isolated from the worm samples using the Pick-Pen (Bio-Nobile, 
Finland) Starter kit combined with the KingFisher mRNA purification kit (ThermoLabsystems, 
10 Finland) according to the manufacturer's instructions. The yield was 1-2 |ig poly(A)^RNA from 
approximately 50 mg of C. elegans worms. 

Synthesis of fluorochrome labelled first strand cDNA from C elegans mRNA 

One ^ig of C elegans poly(A)^RNA was combined with 2 ^.g oligo dT primer (T20VN) 
in an RNase free, pre-siliconized 1 .5 mL tube, and the final volume was adjusted with DEPC- 

15 water to 8 [iL. The reaction mixture was heated at +70°C for 10 minutes, quenched on ice 5 
minutes, spun for 20 seconds, followed by addition of 1 |iL SUPERase-In™ (20U/|LlL, RNAse 
inhibitor, Ambion, USA), 4 5xRTase buffer (Invitrogen, USA), 2 |LlL 0.1 M DTT (Invitrogen, 
USA), 1 dNTP (20mM dATP, dGTP, dTTP; 4 mM dCTP in DEPC-water, Amersham 
Pharmacia Biotech, USA), and 3 ^iL Cy3™-dCTP (Amersham Pharmacia Biotech, USA). First 

20 strand cDNA synthesis was carried out by adding 1 |iL of Superscript™ II (Invitrogen, 200 
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U/mL), mixing, and incubating the reaction mixture for one hour at 42°C. An additional 1 |LiL of 
Superscript™ II was added and the cDNA synthesis reaction mixture was incubated for an 
additional one hour at 42°C; the reaction was stopped by heating at lO^'C for 5 minutes, and 
quenching on ice for 2 minutes. The RNA was hydrolyzed by adding 3 |iL of 0.5 M NaOH and 

5 incubating at 70°C for 15 minutes. The samples were neutralized by adding 3 fiL of 0.5 M HCl 
and purified by adding 450 \iL IxTE buffer, pH 7.5 to the neutralized sample and transferring 
the samples onto a Microcon-30 concentrator. The samples were centrifuged at 14000xg in a 
microcentrifuge for ---S minutes, the flow-through was discarded, and the washing step was 
repeated twice by refilling the filter with 450 ^il IxTE buffer and by spinning for -12 minutes. 

10 Centrifugation was continued until the volume was reduced to 5 |iL, and finally the labelled 
cDNA probe was eluted by inverting the Microcon-30 tube and spinning at lOOOx^ for 3 
minutes. 

Printing and Coupling of the C, elegans Microarrays 

The C elegans gene T01D3.3/exon 4 capture probes were synthesized with a 5' 

1 5 anthraquinone (AQ)-modification, followed by either a hexaethyleneglycol-2 or a 

hexaethyleneglycol-4 (HEG2/HEG4) linker (Table 4). The capture probes were first diluted to a 
10 |iM final concentration in 100 mM Na-phosphate buffer pH 7.0, followed by a two-fold 
dilution series (10 ^iM, 5 |iM, 2.5 |LiM, 1.25 |iM, 0.625 |LiM, 0.31 ^M, and 0.155 |LlM) and spotted 
on Exiqon's polycarbonate microarray slides using the Biochip Arrayer One (Packard Biochip 

20 Technologies, USA) with a spot volume of 3x 300 pi and 400 |Lun between the spots. The 

capture probes were immobilized onto the microarray slide by UV irradiation in a Stratalinker 
for 90 seconds at full power (Stratagene, USA). Non-immobilized capture probe 
oligonucleotides were removed from the slides by washing the slides for 24 hours in milli-Q 
H2O. After washing, the slides were dried in an oven at 2>TC for 30 minutes and stored in a slide 

25 box until microarray hybridization. 

Hybridization with CyS-labelled cDNA 

The arrays were hybridized overnight using the following protocol. The Cy3™-labelled 
cDNA sample was combined with 3 ^iL 20xSSC (3xSSC final), 0.5 |ilL 1 M HEPES, pH 7.0 (25 
mM final), 25 ^ig yeast tRNA (1 .25 ^ig/^iL final), 10 ^ig PolyA blocker (0.5 |ig/|iL final), 0.6 \xL 

30 10% SDS (0.3% final), and DEPC-treated water to 20 \ih final volume. The labelled cDNA 
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target sample was filtered in a Millipore 0.22 micron spin column according to the 
manufacturer's instructions (Millipore, USA), and the probe was denatured by incubating the 
reaction at 100°C for 2 minutes. The sample was cooled at 20-25'^C for 5 minutes by spinning at 
maxium speed in a microcentrifuge, and then carefully applied on top of the microarray. A cover 
slip was laid over the microarray and the hybridization was performed for 16 hours at 63 ®C in a 
hybridization chamber (Coming, USA) submerged in a water bath, with an aliquot of 30 |liL of 
3xSSC added to both ends of the hybridization chamber to prevent evaporation. After 
hybridization, the slide was removed carefully from the hybridization chamber and washed using 
the following protocol. The coverslip was washed off in 2 x SSC, pH 7.0 containing 0.1% SDS 
at room temperature for one minute, followed by washing of the microarrays subsequently in 1 .0 
X SSC, pH 7.0 at room temperature for one minute, and then in 0.2 x SSC, pH 7.0 at room 
temperature for one minute. Finally, the slides were washed for 5 seconds in 0.05 x SSC, pH 
7.0. The slides were then dried by centrifugation in a swinging bucket rotor at approximately 
600 rpm for 5 minutes. 
Microarray data analysis 

The C elegans gene T01D3.3 exon 4 array was scanned in an ArrayWoRx Scanner 
(Applied Precision, USA) using an exposure time of 5 seconds, resolution of 5.0, and high (high 
level) sensitivity. The hybridization data were analyzed using the ArrayVision image analysis 
software package 5.1 (IMAGING Research Inc., USA). As shown in Figs. 8A and 8B, analysis 
of the hybridization data from the C elegans gene 26/T01D3.3 exon 4 array demonstrates that 
the use of LNA-modified capture probes for the C elegans T01D3.3 exon results in 5-fold 
increased sensitivity in exon 4 capture compared to the corresponding DNA oligonucleotide 
capture probe controls printed on the same microarray. Capture probes of other lengths and/or 
with other LNA substitution patterns can be used similarly. 
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Example 4: Assessment of Capture Probe Specificity for the C. elegans Gene T01D3.3 
Exons 4 and 5 Using Synthetic Antisense Target Oligos 

Capture probe design: Ex on-specific capture probes for the C. elegans gene T01D3,3 exons 4 
5 and 5 were designed as described in Example 2. 

Design of the LNA-modified capture probes: For the LNA-spiked oligonucleotide capture 
probes, every fourth DNA nucleotide was substituted with an LNA nucleotide, as shown in Table 
5: C. elegans gene T01D3.3/exons 4 and 5-specific capture probes and synthetic target 
10 oligonucleotides. 


Tables. (SEQIDNOs:) 


Capture probes 

Sequence (LNA=uppercase, DNA=lowercase letters) 

CEgene26.04-70 

ggctggaacagaagtttgttggtgcgtgacaaggtatggaagaagattatccggaaaagaa 
agcaaagac 

CEgene26.05-70 

tatgtggcgcgaatgagcaatattcagcatgtttctcctcttgtcaaccatcatgtcaagatcctt 
caac 

CEgene26.04-50 

ggctggaacagaagtttgttggtgcgtgacaaggtatggaagaagattat 

CEgene26.05-50 

tatgtggcgcgaatgagcaatattcagcatgtttctcctcttgtcaacca 

CEgene26.04-40 

ggctggaacagaagtttgttggtgcgtgacaaggtatgga 

CEgene26.05-40 

tatgtggcgcgaatgagcaatattcagcatgtttctcctc 

CEgene26.04-30 

gaacagaagtttgttggtgcgtgacaaggt 

CEgene26.05-30 

tatgtggcgcgaatgagcaatattcagcat 

CEgene26.04-50HEG2 

GgctGgaamCagaAgttTgttGgtgmCgtgAcaaGgtaTggaAgaaGattAt 

CEgene26.04-50HEG4 

GgctGgaamCagaAgttTgttGgtgmCgtgAcaaGgtaTggaAgaaGattAt 

CEgene26.05-50HEG2 

TatgTggcGcgaAtgaGcaaTattmCagcAtgtTtctmCctcTtgtmCaacmCa 

CEgene26.05-50HEG4 

TatgTggcGcgaAtgaGcaaTattmCagcAtgtTtctmCctcTtgtmCaacmCa 

CEgene26.04.40HEG2 

GgctGgaamCagaAgttTgttGgtgmCgtgAcaaGgtaTgga 

CEgene26.04-40HEG4 

GgctGgaamCagaAgttTgttGgtgmCgtgAcaaGgtaTgga 

CEgene26.05-40HEG2 

TatgTggcGcgaAtgaGcaaTattmCagcAtgtTtctmCctc 

CEgene26.05-40HEG4 

TatgTggcGcgaAtgaGcaaTattmCagcAtgtTtctmCctc 

CEgene26.04-30HEG2 

GaacAgaaGtttGttgGtgcGtgamCaagGt 

CEgene26.04-30HEG4 

GaacAgaaGtttGttgGtgcGtgamCaagGt 

CEgene26.05-30HEG2 

TatgTggcGcgaAtgaGcaaTattmCagcAt 

CEgene26.05-30HEG4 

TatgTggcGcgaAtgaGcaaTatlmCagcAt 
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Capture probes 

Sequence (LNA=uppercase, DNA=lowercase letters) 

Target oligos 

Sequence (LNA=uppercase, DNA=lowercase letters) 

CEgene26.04-biotarget 

accttgtcacgcaccaacaaacttctgttc 

CEgene26.05-biotarget 

atgctgaatattgctcattcgcgccacata 


Printing and coupling of the C. elegans gene T01D3.3/exon 4-5 microarrays 

The C. elegans gene T01D3.3/exon 4-5 capture probes were synthesized with a 5' 
5 anthraquinone (AQ)-modification, followed by either a hexaethyleneglycol-2 or a 

hexaethyleneglycol-4 (HEG2/HEG4) linker (Table 5). The capture probes were first diluted to a 
10 ^iM final concentration in 100 mM Na-phosphate buffer pH 7.0, followed by a two-fold 
dilution series (10 |LlM, 5 |J-M, 2.5 |LlM, 1.25 ^iM, 0.625 |iM, 0.31, ^iM, and 0.155 \xM) and 
spotted on Euray polycarbonate microarray slides using the Biochip Arrayer One (Packard 

10 Biochip Technologies) with a spot volume of 3x 300 pi and 400 |im between the spots. The 
capture probes were immobilized onto the microarray slide by UV irradiation in a Stratalinker 
for 90 seconds at full power (Stratagene, USA). Non-immobilized capture probe 
oligonucleotides were removed from the slides by washing the slides for 24 hours in milli-Q 
H2O. After washing, the slides were dried in an oven at 37°C for 30 minutes, and stored in a 

1 5 slide box until microarray hybridization. 

Hybridization of the C. elegans microarrays and post-hybridization washes 

The slides were hybridized with a high (saturated) concentration of l|LiM of each gene 
T01D3.3, exon 4 or 5 target oligo (Table 5) in 50 |il of hybridization solution, containing 25 mM 
HEPES, pH 7.0, 3 x SSC, 0.22 % SDS, and 0.8 ^ig/^il of poly(A) blocker. The target probes 

20 were filtered in a Millipore 0.45 micron spin column (Ultrafree-MC, Millipore, USA), denatured 
by incubation at 100 °C for 2 minutes, cooled at room temperature for 5 minutes, and then 
carefully applied onto the prepared microarray. One-half of a cover slip was laid over the 
microarray, and the hybridization was performed for 16-18 hours at 63*^C in a hybridization 
chamber (Coming, USA). 

25 Following hybridization, the slides were washed sequentially by plunging gently in 1 x 

SSCT (150 mM NaCl, 15 mM Sodium Citrate + Tween 20) at room temperature for one minute, 
then in 0.2 x SSCT (30 mM NaCl, 3 mM Sodium Citrate + Tween 20) at room temperature for 
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one minute, and finally in Milli Q water, followed by drying of the slides in an oven at 37°C for 
30 minutes. The slides were Cy5 labelled using a Cy5-straptavidin target. Thirty jxl of a Cy5- 
streptavidin (2|Lig/ml in 1 x SSCT) were carefully applied onto the hybridized microarray and 
incubated one hour at room temperature before an additional washing step were performed in 1 x 
5 SSCT (1 50 mM NaCl, 1 5 mM Sodium Citrate + Tween 20) at room temperature for one minute, 
then in 0.2 x SSCT (30 mM NaCl, 3 mM Sodium Citrate + Tween 20) at room temperature for 
one minute, and finally in Milli Q water. Following washing, the slides were drying in an oven 
at ST'^C for 30 minutes and stored in a slide box in the dark until scanning. 
Microarray data analysis 

10 The C. elegans gene T01D3.3 exon 4-5 microarray was scanned in an Array WoRx 

Scanner (Applied Precision, USA) using an exposure time of 5 seconds, resolution of 5.0, and 
high (high level) sensitivity. The hybridization data were analyzed using the ArrayVision image 
analysis software package 5.1 (IMAGING Research Inc., USA). As shown in Figs. lOA and 
lOB, analysis of the hybridization data from the C. elegans gene 26/T01D3.3 array demonstrates 

15 that both the DNA as well as the LNA capture probes for the C elegans T01D3.3 exons 4 (Fig. 
lOA) and exon 5 (Fig. lOB), respectively are highly specific with a very low level of cross- 
hybridization between their respective target oligonucleotides. The exon-specific design of the 
oligonucleotide capture probes is thus validated. Capture probes of other lengths and/or with 
other LNA substitution patterns can be used similarly. 

20 

Example 5: Detection of Alternatively Spliced Isoforms using Internal Exon-specific, and 
Exon-Exon Junction-Specific (merged) LNA-modified Capture Probes 

Oligonucleotide design for microarray s: 

Methods for designing exon-specific internal oligonucleotide capture probes has been 
25 described in Example 2. 

Design of the LNA-modified capture probes 

For the LNA-modified oligonucleotide capture probes, every third DNA nucleotide was 
substituted with an LNA nucleotide. The probes designed to capture the junction of the 
recombinant splice variants were designed with LNA modifications in a block of five 
30 consecutive LNAs nucleotides, two on the 5' side of the splice junction and three on the 3' side 
of the splice junction. All capture probes are shown in Table 6. 
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Table 6. Internal, exon-specific and merged, exon-exon junction specific oligonucleotide capture 

probes. (SEQ ID NOs: ) 

Sequence (LNA=uppercase5 DNA lowercase letters) 

cctgaaagtagatttgttatttccgaaacgccttctcccgttcttaagtc 
catataccacaaatagtccctcaaaaatcacaagaaaactcacaacactg 
gatttgcagcggtggtaaaaagtatgaaaacgtggtaattaaaaggtctc 
ccaatgaaaactaatcaaaggtaaacgtggatcccatggcaattcccggg 
caacactgcccagaggttcaatcgatccgatgatcctaatgaaggcgccc 
gtccagtatcgtccatcatagtatcgataaatatgtgaaggaaatgcctg 
caacactgcccagaggttcaatcgatgtgtgataggatcagtgttcaggg 
gaaggcgaaggagactgctaatatcgataaatatgtgaaggaaatgcctg 
inCctGaaAgtAgaTttGttAttTccGaaAcgmCctTctniCccGttmCttAagTc 
mCatAtamCcamCaaAtaGtcmCctmCaaAaaTcamCaaGaaAacTcamCaamCacTg 
GatTtgmCagmCggTggTaaAaaGtaTgaAaamCgtGgtAatTaaAagGtcTc 
mCcaAtgAaaActAatmCaaAggTaaAcgTggAtcmCcaTggmCaaTtcmCcgGg 
caacactgcccagaggttcaatcGATmCmCgatgatcctaatgaaggcgccc 
gtccagtatcgtccatcatAGTATcgataaatatgtgaaggaaatgcctg 
caacactgcccagaggttcaatcGATGTgtgataggatcagtgttcaggg 
gaaggcgaaggagactgctAATATcgataaatatgtgaaggaaatgcctg 


Capture probes 

gene78.01a 

gene78.01b 

gene78.03a 

gene78.03b 

gene78.m01INS3 

gene78.mINS303 

gene78.m01INS4 

gene78.mINS403 

gene78.01a_50_LNA3 

gene78.01b_50_LNA3 

gene78.03a_50_LNA3 

gene78.03b_50_LNA3 

gene78.ni01 INS3_50_block 

gene78.mINS303_50_block 

gene78.Tn01 INS4_50_block 

gene78.mINS403_50_block 


5 Printing and Coupling of the Splice Isoform-Specific Microarrays 

The splice variant capture probes were synthesized with a 5' anthraquinone (AQ)- 
modification, followed by a hexaethyleneglycoK2 (HEG2) linker. The capture probes were first 
diluted to a 20 |LiM final concentration in 100 mM Na-phosphate buffer pH 7.0, and spotted on 
the Immobilizer polymer microarray slides (Exiqon, Denmark) using the Biochip Arrayer One 

10 (Packard Biochip Technologies, USA) with a spot volume of 2x 300 pi and 300 |iim between the 
spots. The capture probes were immobilized onto the microarray slide by UV irradiation in a 
Stratalinker with 2300 jijoules (Stratagene, USA). Non-inmiobilized capture probe 
oligonucleotides were removed fi-om the slides by washing the slides two times 15 minutes in 
IxSSC. After washing, the slides were dried by centrifiigation at lOOOx g for 2 minutes, and 

1 5 stored in a slide box until microarray hybridization. 
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Construction of Splice Variant Vectors 

The recombinant splice variant constructs were cloned into the TriamplS vector 
(Ambion, USA). The constructs were sequenced to confirm their construction. The plasmid 
clones were transformed into E. co/z XLIO-Gold (Stratagene, USA). 
5 Triampl8/SWI5 vector construct 

Genomic DNA was prepared from a wild-type standard laboratory strain of 
Saccharomyces cerevisiae using the Nucleon MiY DNA extraction kit (Amersham Biosciences, 
USA) according to the supplier's instructions. Amplification of the partial yeast gene was 
performed using standard PGR with yeast genomic DNA as the template. In the first step of 

10 amplification, a forward primer containing a restriction enzyme site and a reverse primer 

containing a universal linker sequence were used. In this step, 20 base-pairs were added to the 
3 '-end of the amplicon, next to the stop codon. In the second step of amplification, the reverse 
primer was exchanged with a nested primer containing a poly-T2o tail and a restriction enzyme 
site. The SWI5 amplicon contains 730 bp of the SWI5 ORF plus a 20 bp universal linker 

1 5 sequence and a poly-A2o tail. The PGR primers used were YDR146G-For-EcoRI 
(acgtgaattcaaatacagacaatgaaggagatga) (SEQ ID NO: ), 

YDR146C-Rev-Uni (gatccccgggaattgccatgttacctttgattagttttcattggc (SEQ ID NO: )), and 
Uni-polyT-BamHI (acgtggatccttttttttttttttttttttgatccccgggaattgccatg (SEQ ID NO: )). 

The PGR amplicon was cleaved with the restriction enzymes ^'coRI and BdmHl, The 
20 DNA fragment was ligated into the pTRIampl 8 vector (Ambion, USA) using the Quick Ligation 
Kit (New England Biolabs, USA) according to the supplier's instructions and transformed into E, 
coli DH-5a by standard methods. 

Construction of the Recombinant Splice Variant #7 (T riamplS/swiS-rubisco) 

Arabidopsis thaliana Rubisco small subunit ssu2b gene fi-agment (gi 17064721) was 

25 amplified from genomic DNA using primers named DJ305 (5'- 

AGTATGATGGAGGATAGTGGAG-3' (SEQ ID NO: )) and DJ306 (5'- 
ATTGGATGGATGGGATGATGGTAATGAAGGG-3' (SEQ ID NO: )), containing CM 
restriction site linkers. The purified PGR fi-agment was digested with Glal and then cloned into 
the swi5 (gl:7839148) vector at the unique Clal site (atcgat) giving each insert a flanking 

30 sequence fi-om the original yeast SWI5 insert (named exonOl and exon 03, Fig. 1 1). The product 
was inserted in the reverse orientation, so that the insert sequence is as follows: 
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AtcgatCCGATGATCCTAATGAAGGCGCCCGGGTACTCCTTCTTGCATTCTTCAACTTCC 
TTCAACACTTGAGCGGAGTCGGTGCATCCGAACAATGGAAGCTTCCACATTGTCCAG 
TATCGTCCATCATAGTatcgat (SEQ ID NO: ). 

Nucleotide sequence analysis revealed a difference between the sequence of A. thaliana 
5 rubisco expected from the GenBank database and that obtained from all sequenced constructs 
and PGR products. Position 30 in the Rubisco insert is "C" rather than the expected "A." This 
SNP was probably created by PGR. None of the oligonucleotide capture probes used in the 
example cover this region. The Rubisco sequence in Genbank is TCCTAATGAAGGCGCCA 
(SEQ ID NO: ), and the sequence obtained from the plasmid contruct is 
10 TCCTAATGAAGGCGCCC (SEQ ID NO: ). 

Construction of the Recombinant Splice Variant # 2 (T riampJ S/swiS-lea) 

ThG Arabidopsis thaliana Lea gene (gi 1526423) was amplified from genomic DNA with 
primers named DJ307 (5'-GGAATTATCGATGTGTGATAGGATCAGTGTTCAG-3' (SEQ ID 
NO: )), and DJ308 (5'-AATTGGATCGATATTAGCAGTCTCCTTCGCC-3' (SEQ ID NO: )), 
15 including the Clal linker sites as above. The PGR fragment was digested with Clal cloned into 
the yeast SWI5 IVT construct as above at the unique Clal site. 

The fragment was inserted in the forward orientation, resulting in the following insert 
sequence: 

atcgatGTGTGATAGGTTCAGTGTTCAGGGCTGTCCAAGGAACGTATGAGCATGCGAGA 
20 GACGCTGTAGTTGGAAAAACCCACGAAGCGGCTGAGTCTACCAAAGAAGGAGCTCA 
GATAGCTTCAGAGAAAGCGGTTGGAGCAAAGGACGCAACCGTCGAGAAAGCTAAG 
GAAACCGCTGATTATACTGCGGAGAAGGTGGGTGAGTATAAAGACTATACGGTTGA 
TAAAGCTAAAGAGGCTAAGGACACAACTGCAGAGAAGGCGAAGGAGACTGCTAATa 
tcgat. (SEQ ID NO: ) 
25 In vitro RNA Preparation from Splice Variant Vectors 

In vitro RNA from the splice variants were made using the MEGAscript^^ high yield 
transcription kit according to the manufacturer's instructions (Ambion, USA). The yield of IVT 
RNA was quantified at a Nanodrop spectrophotometer (Nanodrop Technologies, USA, Fig. II). 
Isolation of total RNA from C.elegans 
30 C elegans wild-type strain (Bristol-N2) was maintained on nematode growth medium 

(NG) plates seeded with Escherichia coli strain OP50 at 20^C, and the mixed stages of the 
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nematode were prepared as described by Hope (ed.) ("C elegans - A Practical Approach", 

Oxford University Press 1999). The samples were immediately flash frozen in liquid N2 and 

stored at -SO^'C until RNA isolation. 

A 100 |il aliquot of packed C elegans worms from a mixed stage population was 
5 homogenized using the FastPrep Biol 01 from Kem-En-Tec for one minute, speed 6 followed by 

isolation of total RNA from the extracts using the FastPrep Biol 01 kit (Kem-En-Tec) according 

to the manufacturer's instructions. 

The eluted total RNA was ethanol precipitated for 24 hours at - 20^C by addition of 2.5 

volumes of 96% EtOH and 0.1 volume of 3M Na-acetate, pH 5.2 (Ambion, USA), followed by 
10 centriftigation of the total RNA sample for 30 minutes at 13200 rpm. The total RNA pellet was 

air-dried and redissolved in 10 |Lll of diethylpyrocarbonate (DEPC)-treated water (Ambion, USA) 

and stored at -SO^'C. 

Fluorochrome-labelling of the Target 

Ten (10) |ig total RNA from C elegans and 1 ng of in vitro RNA from Splice variant #1 
15 were combined with 5 |ig anchored oligo(dT2o) primer and DEPC-treated water to a final volume 

of 8 |Xl. The mixture was heated at 70^C for 10 minutes, quenched on ice for 5 minutes, 

followed by addition of 20 units of Superasin RNase inhibitor (Ambion, USA), 1 |iil dNTP 

solution (lOmM each dATP, dGTP, dTTP and 0.4 mM dCTP, and 3 ^il Cy5-dCTP, Amersham 

Biosciensces, USA), 4 |xl 5 x RTase buffer (Invitrogen), 2^1 0.1 mM DTT (Invitrogen), 400 units 
20 of Superscript II reverse transcriptase (Invitrogen, USA), and DEPC-treated water to 20 ^il final 

volume. 

A parallel set-up was made with 10 ^g total RNA from Celegans and 1 ng of in vitro 
RNA from Splice variant #2, labelling with Cy3-dCTP. Both cDNA syntheses were carried out 
at 42°C for 2 hours, and the reactions were stopped by incubation at 70^C for 5 minutes, 

25 followed by incubation on ice for 5 minutes. 

Unincorporated dNTPs were removed by gel filtration using MicroSpin S-400 HR 
colunms as described below. The column was pre-spun for one minute at 1 500 x g in a 1 .5 ml 
tube, and the column was placed in a new 1 .5 ml tube. The cDNA sample was slowly applied to 
the top center of the resin and spun 1500-x g for 2 minutes. The eluate was collected. RNA was 

30 degraded by adding 3 |al of 0.5 M NaOH. The solution was mixed well and incubated at 70 ""C 
for 15 minutes. The solution was neutralized by adding 3 ^1 of 0.5 M HCl and mixed well. 
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Then, 450 ^il IxTE, pH 7.5 was added to the neutralized sample, and the sample was transferred 
onto a Microcon-30 concentrator (prior to use, 500 ^1 IxTE was spun through the column to 
remove residual glycerol). The samples were spun at 14000 x g in a micro centrifuge for 12 
minutes, and the volume was checked. Spinning was continued until the volume was reduced to 
5 5 |iil. The labelled cDNA probe was eluted by inverting the Microcon-30 tube and spinning at 
1000 X g for 3 minutes. The Microcon filter was checked for proper elution. 
Comparative Hybridization of the Splice Variant Microarrays and Post-hybridization washes 

The Cy3 and Cy5 -labelled cDNA samples, respectively, were combined in one tube. The 
following was added: 3.75 ^1 20x SSC (3x SSC final, pass through 0.22 tifilter prior to use to 

10 remove particulates), yeast tRNA (1 ^g/^il final), 0.625 ^1 1 M HEPES, pH 7.0 (25 mM final, 
pass through 0.22 iifilter prior to use to remove particulates), 0.75 ^1 10 % SDS (0.3 % final), 
and DEPC-water to 25 ^l final volume. The labelled cDNA target sample was filtered in 
Millipore 0.22 filter spin column (Ultrafi-ee-MC, Millipore, USA) according to the 
manufacturer's instructions, followed by incubation of the reaction mixture at 100 °C for 2-5 

15 minutes. The cDNA probes were cooled at room temperature for 2-5 minutes by spinning at 

maxium speed in a microcentrifuge. A LifterSlip (Erie Scientific Company, USA) was carefully 
placed on top of the microarray spotted on Immobilizer™ MicroArray Slide, and the 
hybridization mixture was applied to the array from the side. An aliquot of 30 |ilL of 3xSSC was 
added to both ends of the hybridization chamber, and the Immobilizer™ MicroArray Slide was 

20 placed in the hybridization chamber (DieTech, USA). The chamber was sealed watertight and 
incubated at 65°C for 16-18 hours submerged in a water bath. After hybridization, the slide was 
removed carefully from the hybridization chamber and washed using the following protocol. 

The slides were washed sequentially by plunging gently in 2 x SSC/0. 1% SDS at room 
temperature until the cover slip falls of into the washing solution, then in Ix SSC pH 7.0 (1 50 

25 mM NaCl, 15 mM Sodium Citrate) at room temperature for one minute, then in 0.2 x SSC, pH 
7.0 (30 mM NaCl, 3 mM Sodium Citrate) at room temperature for one minute, and finally in 
0.05 X SSC (7.5 mM NaCl, 0.75 mM Sodium Citrate) for 5 seconds, followed by drying of the 
slides by spinning at 1000 x g for 2 minutes. The slides were stored in a slide box in the dark 
until scanning. 

30 Microarray data analysis 
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The splice variant microarray was scanned in a ScanArray 4000XL confocal laser 
scanner (Packard Instruments, USA). The hybridization data were analyzed using the GenePix 
Pro 4.01 microarray analysis software (Axon, USA). 

In the data analysis, the experimental variation in the labelling efficiency between the two 
5 fluorescent dyes was normalized (scaled) as follows. The average signal intensities from the 
"exonl" and "exon3" internal capture probes (Table 6), were used to calculate normalization 
factor of 2.75. This factor was multiplied to the signal intensity values from the Cy-3 target. 

Analysis of the data from the specific detection of the two recombinant splice variants in 
a complex RNA pool demonstrates that the merged capture probes containing a LNA block have 
10 significantly higher signals and a very low level of cross-hybridization, compared to the DNA 

capture probes (Figs. 12A and 12B). In addition, the specific detection of the two artificial splice 
variants #1 and #2 is validated with the results from LNA-modified oligonucleotide capture 
probes. Capture probes of other lengths and/or with other LNA substitution patterns can be used 
similarly. In contrast, the corresponding DNA oligonucleotide capture probes fail to detect 
1 5 splice variant #1 (Fig. 1 2B), 

Example 6: The Use of LNA-modified Oligonucleotides in Microarrays Provides 
Significantly Improved Sensitivity in Expression Profiling 

This example demonstrates the advantages of using LNA oligonucleotide microarrays in 

20 gene expression profiling experiments. Capture probes for the Saccharomyces cerevisiae gene 
SWI5 (YDR146C) were designed as 50-mer standard DNA and two different LNA-modified 
oligonucleotides with LNA substitutions at every second or every third nucleotide position, 
respectively, for comparison (Table 7). To assess the sensitivity of DNA versus LNA capture 
probes, hybridizations with different amounts of biotin-labelled antisense oligonucleotides in a 

25 1 0-fold dilution series were performed. 

Design and Synthesis of the LNA Capture Probes 

To design capture probes, regions with unique mRNA sequence of the selected target 
genes were identified. Optimized 50-mer oligonucleotide sequences with respect to T^, self- 
complementarity, and secondary structure were selected. LNA modifications were incorporated 

30 to increase affinity and specificity. The biotin-labelled antisense DNA target oligonucleotide 
corresponds to the reverse complement sequence. 
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Printing of the LNA Microarrays 

The microarrays were printed on Immobilizer™ MicroArray Slides (Exiqon, Denmark) 
using the Biochip One Arrayer from Packard Biochip technologies (Packard, USA). The arrays 
were printed with a spot volume of 2x300 pi of a 10 ixM (final concentration) capture probe 
5 dilution. Four replicas of the capture probes were printed on each slide 
Hybridization with Biotin-labelled Antisense Oligonucleotide 

The arrays were hybridized overnight using the following protocol. The desired amount 
of biotin-labelled oligonucleotide was combined in one tube followed by addition of 3 ^iL 
20xSSC (3xSSC final), 0.5 |LiL 1 M HEPES, pH 7.0 (25 mM final), 25 |ig yeast tRNA (1.25 

10 |ig/|j,L final), 0.6 [iL 10% SDS (0.3% final), and DEPC-treated water to 20 ^iL final volume. 
The biotin-labelled target sample was filtered in a Millipore 0.22 micron spin column according 
to the manufacturer's instructions (Millipore, USA), and the probe was denatured by incubating 
the reaction at 100°C for 2 minutes. The sample was cooled at 20-25°C for 5 minutes by 
spinning at maxium speed in a microcentrifuge. A LifterSlip (Erie Scientific Company, USA) 

15 was carefully placed on top of the microarray spotted on Immobilizer™ MicroArray Slide, and 
the hybridization mixture was applied to the array from the side. An aliquot of 30 \xL of 3xSSC 
was added to both ends of the hybridization chamber, and the Immobilizer™ MicroArray Slide 
was placed in the hybridization chamber. The chamber was sealed watertight and incubated at 
65°C for 16-18 hours submerged in a water bath. After hybridization, the slide was removed 

20 carefully from the hybridization chamber and washed using the following protocol. The 

Lifterslip coverslip was washed off in 2xSSC, pH 7.0 containing 0.1% SDS at room temperature 
for 1 minute, followed by washing of the microarrays subsequently in l.OxSSC, pH 7.0 at room 
temperature for 1 minute, and then in 0.2xSSC, pH 7.0 at room temperature for 1 minute. 
Finally, the slides were washed for 5 seconds in 0.05xSSC, pH 7.0. The slides were then dried 

25 by centriftigation in a swinging bucket rotor at approximately 200 G for 2 minutes. To visualize 
the biotin containing duplexes, an aliquot of 40 |iL of the 2 ^ig/ml streptavidin-Cy3 in 
lxSSC+0.05 % Tween solution was applied to the slide as described for the hybridization 
mixture above. The slide was incubated in a humidified chamber for 1 hour at room 
temperature. The coverslip was washed off in lxSSC+0.05 % Tween for 1 minute, followed by 
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wash in 0.2xSSC+0.05 % Tween for 1 minute and then 10 seconds in MilliQ-water. The slide 
was dried by centrifugation in a swinging bucket rotor for 2 minutes at 200 G. 
Data Analysis 

Following washing and drying, the slides were scanned using a ScanArray 4000XL 
5 scanner (Perkin-Elmer Life Sciences, USA), and the array data were processed using the 
GenePix™ Pro 4.0 software package (Axon, USA). 
Results 

Incorporation of LNA nucleotides at every second or third nucleotide position in standard 
50-mer expression array oligonucleotide capture probes results in a 2-7-fold increase in 
10 fluorescence intensity levels using an unsaturated target concentration and hybridizing under 
standard stringency conditions (Fig. 13). Thus, it can clearly be concluded that the LNA 
oligonucleotides are more sensitive in expression profiling compared to DNA oligonucleotides. 

Table 7. DNA and LNA-modified SWI5 (YDR146C) oligonucleotide capture probes. LNA 
15 modifications are depicted by uppercase letters in the sequence; "mC" denotes LNA methyl 
cytosine. (SEQ ID NO: ) 
Oligo Name Sequence 

YDR 1 46C-50 tgggaatggaacggggattatggtttcgccaatgaaaactaatcaaaggt 

YDR 1 46C-50_LN A2 TgGg AaTgGaAcGgGgAtTaTgGtTtmCgmCc AaTgAa AamCtAaTcAa AgGt 

YDR 1 46C-50_LN A3 TggGaaTggAacGggGatTatGgtTtcGcc AatGaa AacTaaTcaAagGt 

Example 7: The Use of LNA-modified Oligonucleotides in Microarrays Provides 
Significantly Improved Sensitivity in Comparative Genome Hybridization (CGH). 

20 This example demonstrates the advantages of using LNA oligonucleotide microarrays in 

Comparative Genome Hybridization (CGH) experiments. Capture probes for all 23 exons of the 
Menkes gene (ATP7A) v^ere designed as 50-mer standard DNA and different LNA/DNA mixmer 
oligonucleotides, respectively, for comparison (Fig. 17). The C6-amino-linked capture probes 
were appHed to ImmobiHzer slides and hybridized with patient DNA samples labelled with a 

25 Cy3 fluorescent dye. 


100 


Design and synthesis of the LNA capture probes 

To design the capture probes, regions comprising individual exons of the Menkes gene 
were identified. The optimal 50-mer oligonucleotide sequences with respect to T^, self- 
complementarity, and secondary structure were selected for each exon. LNA modifications were 
5 incorporated to increase affinity and specificity. A software tool "OligoDesign", which 
automatically designs capture probes that are optimized for sequence specificity, Tm, self- 
complementarity, secondary structure, and LNA modifications was used for oligonucleotide 
design. 
Results 

10 Fluorescent Cy3 labelled patient genomic DNA was hybridized to microarrays spotted 

with the CGH capture probes listed in Fig. 17. Compared to DNA capture probes, capture 
probes with LNA in every second position (LNA-2) had a significantly better capture rate of 
non-amplified labelled genomic patient DNA as shown in Figs. 14-16. Capture probes of other 
lengths and/or with other LNA substitution patterns can be used similarly. 

15 

Example 8: Expression Profiling of Stress and Toxicity in Caenorhabditis elegans using 
LNA Oligonucleotide Microarrays 

This example demonstrates the use of the C. elegans LNA tox oligoarray in gene 
expression profiling experiments in the nematode Caenorhabditis elegans. The C elegans tox 

20 oligoarray monitors the expression of a selection of 1 10 genes relevant for general stress 

response and for the metabolism of toxic compounds. Two different capture probes for each of 
these target genes were designed and included in the LNA tox array. In addition, the C elegans 
LNA tox oligoarray contained capture probes providing control for cDNA synthesis efficiency 
and the developmental stage of the nematode. Capture probes for constitutively expressed genes 

25 for data set normalization were also included on the C. elegans LNA tox oligoarray. 

Cultivation of C elegans Worms 

For all cultures, the sample was divided into two, and one half of the sample was used as 
the control, the other was used as the treated sample. Worm samples were harvested and sucrose 
cleaned by standard methods. For heat shock treatment, the heat shock sample was added to S- 
30 media preheated to 33°C in a 1 L flask suspended in a water bath at 33°C, the other sample was 
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added to a 1 L flask with S-media at 25^C. Both samples were shaken at approximately 100 rpm 
for an hour. For Lansoprazole treatment, 0.5 mL of 1 0 mg/mL Lansoprazole (Sigma) in DMSO 
was added to each 500 mL volume of S-media culture after 28 hours of growth from LI . At the 
same time, 0.5 mL of DMSO was added to the control. Incubation was for 24 hours. Samples 
5 were then harvested by centrifiigation at 3000xg suspended in RNALater™ (Ambion) and 
immediately frozen in liquid nitrogen. 

RNA Extraction 

RNA was extracted from the worm samples using the FastRNA® Kit, GREEN (Q-BIO) 
10 essentially according to the suppliers' instructions. 

Design and Synthesis of the LNA Capture Probes 

To design the capture probes, regions with unique mRNA sequence of the selected target 
genes were identified. The optimal 50-mer oligonucleotide sequences with respect to Tm, self- 
complementarity, and secondary structure were selected. LNA modifications were incorporated 

15 to increase affinity and specificity. 
Printing of the LNA Microarrays 

The microarrays were printed on Immobilizer™ MicroArray Slides (Exiqon, Denmark) 
using the Biochip One Arrayer from Packard Biochip technologies (Packard, USA). The arrays 
were printed with a spot volume of 2x300 pi of a 1 0 jiM capture probe solution. Four replicas of 

20 the capture probes were printed on each slide. 

Synthesis of Fluorochrome Labelled First Strand cDNA from Total RNA 

15 |ig of C elegans total RNA was combined with 5 \ig oligo dT primer (T20VN) in an 
RNase free, pre-siliconized 1 .5 mL tube, and the final volume was adjusted with DEPC-water to 
8 |J-L. The reaction mixture was heated at +70°C for 10 minutes, quenched on ice 5 minutes, 

25 spin 20 seconds, followed by addition of 1 |ilL SUPERase-In™ (20U/^L, Ambion, USA), 4 ^iL 
5xRTase buffer (Invitrogen, USA), 2 p.L 0.1 M DTT (Invitrogen, USA), 1 ^iL dNTP (20mM 
dATP, dGTP, dTTP; 0.4 mM dCTP in DEPC-water, Ajnersham Pharmacia Biotech, USA), and 3 
\xL Cy3™-dCTP or Cy5™-dCTP (Amersham Pharmacia Biotech, USA). First strand cDNA 
synthesis was carried out by adding 1 [iL of Superscript™ II (Invitrogen, 200 U/mL), mixing, 

30 and incubating the reaction mixture for 1 hour at 42°C. An additional 1 jlL of Superscript™ II 
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was added, and the cDNA synthesis reaction mixture was incubated for an additional 1 hour at 
42^C; the reaction was stopped by heating at 70°C for 5 minutes, and quenching on ice for 2 
minutes. The RNA was hydrolyzed by adding 3 |iL of 0.5 M NaOH, and incubating at 70°C for 
15 minutes. The samples were neutralized by adding 3 |iL of 0.5 M HCl, and purified by adding 
5 450 \iL IxTE buffer, pH 7.5 to the neutralized sample and transferring the samples onto a 
Microcon-30 concentrator. The samples were centrifuged at 14000xg in a microcentrifuge for 
-8 minutes, the flow-through was discarded, and the washing step was repeated twice by 
refilling the filter with 450 ^1 IxTE buffer and by spinning for -12 minutes. Centrifiigation was 
continued until the volume was reduced to 5 |LiL, and finally the labelled cDNA probe was eluted 

10 by inverting the Microcon-30 tube and spinning at lOOOxg for 3 minutes. 
Hybridization with Fluorochrome-labelled cDNA 

The arrays were hybridized overnight using the following protocol. The Cy3™ and 
Cy5™-labelled cDNA samples were combined in one tube followed by addition of 3 |xL 20xSSC 
(3xSSC final), 0.5 |iiL 1 M HEPES, pH 7.0 (25 mM final), 25 ^g yeast tRNA (1.25 ^g/^L final), 

15 0.6 |lL 10% SDS (0.3% final), and DEPC-treated water to 20 \ih final volume. The labelled 
cDNA target sample was filtered in a Millipore 0.22 micron spin column according to the 
manufacturer's instructions (Millipore, USA), and the probe was denatured by incubating the 
reaction at 100°C for 2 minutes. The sample was cooled at 20-25*^C for 5 minutes by spinning at 
maximum speed in a microcentrifuge. A LifterSlip (Erie Scientific Company, USA) was 

20 carefully placed on top of the microarray spotted on Immobilizer™ MicroArray Slide, and the 
hybridization mixture was applied to the array from the side. An aliquot of 30 |LiL of 3xSSC was 
added to both ends of the hybridization chamber, and the Immobilizer™ MicroArray Slide was 
placed in the hybridization chamber. The chamber was sealed watertight and incubated at 65°C 
for 16-18 hours submerged in a water bath. After hybridisation, the slide was removed carefully 

25 from the hybridization chamber and washed using the following protocol. The Lifterslip 
coverslip was washed off in 2xSSC, pH 7.0 containing 0.1% SDS at room temperature for 1 
minute, followed by washing of the microarrays subsequently in 1 .OxSSC, pH 7.0 at room 
temperature for 1 minute, and then in 0.2xSSC, pH 7.0 at room temperature for 1 minute. 
Finally, the slides were washed for 5 seconds in 0.05xSSC, pH 7.0. The slides were then dried 
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by centriftigation in a swinging bucket rotor at approximately 200 G for 2 minutes. The slide 
was then ready for scanning. 


Data analysis, 

5 Following washing and drying, the slides were scanned using a ScanArray 4000XL 

scanner (Perkin-Elmer Life Sciences, USA), and the array data were processed using the 
GenePix™ Pro 4.0 software package (Axon, USA). The data in each image was normalized so 
that the ratio of means of all of the features is equal to 1 . 


10 Results 

Use of LNA-modified oligonucleotide capture probes in the C. elegans LNA tox 
oligoarray clearly allows the identification of distinct expression profiles for C elegans genes 
relevant for general stress response and for the metabolism of toxic compounds. 


15 Table 12. Expression profiling using LNA Oligonucleotide Microarrays. Log2 transformed fold 
of changes for five selected genes in the two expression profiling experiments. 
Protein name 

(clone name) Heat shock Lansoprazole 

HSP70 (F44E5.4/5) 4.11 nd 

CYP37A(F01D5.9) nd 0.98 
Ubiquitin(M7.1) 0.16 -0.12 

Histone IQ (C01B10.5) -1.49 nd 
HSP90 (C47E8.5) nd -1.17 

Table 13. LNA-modified oligonucleotide capture probes. LNA modifications are depicted by 
uppercase letters in the sequence; "mC" denotes LNA methyl cytosine. (SEQ ID NOs: ) 
Oligo Name Sequence 

CEABC_C34G6.4_u293_LNA3 TgcmCatTgcAcgGgcActTgtTcgAtcTccTtcTgtTttActTttGgaTg 
CEABC_C34G6.4_u375_LNA3 TcaTtcTagGatTgcmCagAtgGttAtgAtamClcAtgTcgGagAgaAagGa 
CEABC_F57C1 2.4_u1 5_LNA3 mCcaAtgTtgTttAatTggTtgTaaTgtmCttGatGacmCtgmCatAatmCatAt 
CEABC_F57C12.4_u480_LNA3 mCacAagAtcmCtgTgtTgtTctmCcgGaamCaaTgaAaaTgaActTagAtcmCa 
CEABC_F57C1 2.5_u1 1 1_LNA3 TacTtgTtcTcgAcaAagGttGtgTagmCcgAgtTtgAcamCtcmCgaAgaAa 
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Oligo Name Sequence 

CEABC_F57C12.5_u444_LNA3 TgaActTggAtcmCctTctTtgmCatTtaGcgAtgAtcAaaTttGggAagmCg 

CEABC_K08E7.9_d8_LNA3 TcaTtaAttTtgTgtAgcTttmCttTctmCgaTttTtgmCacGatmCttTccmCc 

CEABC_K08E7.9_u51_LNA3 AggGtgmCctActAcaAacTgamCccAaaAgcAgaTgamCcgAgaAgaAatAa 

CEABC_Y39D8C.1_u37_LNA3 AttGaaAgcGacGcgGaaAgtGccAtgTatTtcTaaTttTgtTttmCttTa 

CEABC_Y39D8C.1_u422_LNA3 TtgTcaGcaTatmCaaGagTagAtaTggAagTggAtamCacTctGctAatmCc 

CEADH_H24K24.3a_d3_LNA3 mCacmCttAttGcgTtcAatTttTgtTtcmCacmCtamCtamCtamCgaAtamCgtTg 

CEADH_H24K24.3a_u50_LNA3 TcamCaaGggAgaGagTctGcgGtcGgtGctGgcGttmCgaGaaAatAtaAc 

CEAPEX_R09B3.1_u191_LNA3 mCatGcaTccmCgamCgaGaaGaaGtamCtcAttTtgGagTtaTctGgcGaaTt 

CEAPEX_R09B3.1_u37_LNA3 GacmCatGctmCcgGtcGtcAtgmCaaAtcGacTtcTaaAttGctTctGatTa 

CEAPO_C35D10.9_u15_LNA3 TtgmCatGctGttAaaAccTatmCgtGtamCaaTatTgcmCtgTatAttmCccmCt 

CEAPO_C35D10.9_u609_LNA3 TggmCacAgcTtaAtaAcaAatTggAaaGtcGagGatTagTcgGtgTtgAa 

CEAPO_C48D1 .2_u1 76_LNA3 GacAcamCgcAaaGgaTatGgaTgtTgtTgaGctGctGacTgaAgtmCaaTa 

CEAPO.C48D1.2_u23_LNA3 AgcAcgAaamCtcTgcmCgtmCtaAaaTtcActmCgtGatTcaTtgmCccAatTg 

CEAPO_F20C5.1_u453_LNA3 AtgGtcAtamCtcTaaAatGggmCagAacTtcAacmCaaAtcAttmCtcGtcAg 

CEAPO_F20C5.1_u96_LNA3 AacmCcgAgcTtgmCcgmCaaAgtGcaAgaAaaTtaTagAacGaaTgaAacAg 

CEATPase_B0365.3_u31_LNA3 GgaTggGtcGagmCgtGagAccTacTacTaaAgaAcaGctTgtGaaTctTt 

mCaamCgtTctmCgaTtcmCtarnCggAcaAgaAtgGacmCtaTgcmCaamCagAaa 

CEATPase_B0365.3_u386_LNA3 Ga 

CEATPase_C1 7H1 2.14_u356_LNA3 TgcTcgTtaTccAgcTatTttGaaGggActTgtmCatGcaAggActTctTc 

CEATPase_C1 7H1 2.14_u89_LNA3 iTiCcgTttAgaGctTatTgcTaamCcaGatTgtmCccAcaAgtmCagAacAgcTc 

CEATPase_F55F3.3_u215_LNA3 TgamCggAcgmCtamCtamCccAtaTgtAttTgtTccAtcTtamCcaGcaAccAa 

CEATPase_F55F3.3_u275_LNA3 AgcTacTtcAttmCgamCaaGgaAcaTctmCggAaaAgtmCaaGtamCatmCccGg 

CEATPase_Y49A3A.2_u103_LNA3 AaaTtcAagGatmCcaGttGccGatGgtGaaGccAagAttmCgcAagGatTa 

CEATPase_Y49A3A.2_u272_LNA3 mCgaTcgTttmCtgmCccAttmCtamCaaGacTgtmCggTatGctmCaaGaaTatGa 

CECALR_Y38A10A.5_u238_LNA3 TcaGgaAcgAtcTttGacAacAttAtcAtcAccGacTctGttGagGagGc 

CECALR_Y38A10A.5_u296_LNA3 TgaActmCtamCtcTtaTgaAagmCtgGggAgcmCatmCggAttmCgaTttGtgGc 

CECAT_Y54G11A.5b_u137_LNA3 GaamCttTgcAggGccGctmCggGgaAtgTcaTgaTttmCatTatTaaGggAa 

CECAT_Y54G11A.5b_u189_LNA3 GtcAatTctGggAgaAggTgtTggAtamCcgGggmCtcGggAgaGaaTgtGc 

CECC_C03D6.3_u275_LNA3 AtgTaaAgaAggAatGctTccmCgaAtgGatTggAtaTttAttTgtmCcaGa 

CECC_C03D6.3_u430_LNA3 GgamCcgAaaTttGtgmCagmCatGtcGgamCacGaaAttGatGglmCtcAttTt 

CECC_C07G2.3_d9_LNA3 mCagAcamCgaAggTtamCgaTagAtaAccAtcTctmCaaAgtmCtaTcgAccTc 

CECC_C07G2.3_u44_LNA3 mCgamCgaTgtGcgTgtTccTgamCgaTgaAagAatGggAtaTtaAgaAaamCc 

CECC_Y46G5A.2_u331_LNA3 TtgTgcTccAtcGctGctmCcgmCttAcaGacTtgAcaAcgmCtcAccTttGc 

CECC_Y46G5A.2_u385_LNA3 AatGagmCggTtgTgcmCgtGtgAcgTcamCttmCgtmCacAgtGttGctmCtamCt 

CECoA_C29F3.1_u316_LNA3 AaaTtgAcamCcaAtcAaaTctGtcTcaTctmCctGagGacmCgtmCaamCttmCg 

CECoA_C29F3.1_u392_LNA3 AatmCttTgtGtamCggAgaTggGgcAaaAggmCagmCaaGaaAgtAaamCcaAg 

CECoA_F08A8,4_u1094_LNA3 AggAcaAggGgcActActGgcAcaGgcTttGatTatTgcAgtGagAtaTt 

CECoA_F08A8.4_u1260_LNA3 TtaAtgGagGtgAcaAlgGgtTccTtgGatTcgAtaAatTccGagTgcmCc 
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Oligo Name 

CECoA_F59F4.1_u1 09_LNA3 
CECoA_F59F4.1_u424_LNA3 
CECoA_Y25C 1 A. 1 3_u 1 1 5_LN A3 
CECoA_Y25C1A.13_u451_LNA3 

CECOL_C27H5.5__u493_LNA3 

CECOL_C27H5.5_u680_LNA3 

CECOQ_ZC395.2_u199_LNA3 

CECOQ_ZC395.2_u400_LNA3 

CECRYZ_F39B2.3_u171_LNA3 

CECRYZ_F39B2.3_u222_LNA3 

CECyclin_R02F2.1 a_u24_LNA3 

CECyclin_R02F2.1a_u312_LNA3 

CECyclin_ZC168.4_u203_LNA3 

CECycIin_ZC1 68.4_u273_LNA3 

CECYP_B021 3. 1 5_u1 33_LNA3 

CECYP_B021 3.1 5_u202_LNA3 

CECYP_B0304.3__u38_LNA3 

CECYP_B0304.3_u89_LNA3 

CECYP_C03G6.14_u706_LNA3 

CECYP_C03G6.14_u768„LNA3 

CECYP_C03G6. 1 5_d9_LNA3 

CECYP_C03G6. 1 5_u 1 48_LNA3 

CECYP_C06B3.3_u102_LNA3 

CECYP_C06B3.3_u474_LNA3 

CECYP_C12D5.7_u399_LNA3 

CECYP_C12D5.7_u65_LNA3 

CECYP_C45H4. 1 7_u27_LNA3 

CECYP_C45H4.17_u598_LNA3 

CECYP_.C45H4.2_u1 1 0_LNA3 

CECYP_C45H4.2_u429_LNA3 

CECYP_C49C8.4_u363_LNA3 

CECYP_C49C8.4_u883_LNA3 

CECYP_C49G7.8_d6_LNA3 

CECYP_C49G7.8_u795_LNA3 

CECYP_F01 D5.9_u374_LNA3 

CECYP^FOI D5.9_.u46_LNA3 

CECYP_F08F3.7_u25_.LNA3 


Sequence 

GctmCttmCtcmCagTggGctmCaaAatAgtmCaamCtcAacAgaTcgGaaGttmCt 

AaaGctTcgAgaTggmCacGttmCgtmCtgTatmCtcGtgAagAacTtaTtgmCa 

GatTcgmCtgAacTttAtcAagAcgTggAatAtgAgcmCagmCtcmCtgTcgAc 

GatmCttAtcAccGcgTgcGatAttmCgaGtaGctTcamCagGatGcgAttTt 

GgaAagGaaGgaTccAttmCtcAgcTctGcamCttmCcamCcaTcaGagmCcaTg 

TggAtamCaaGgaGggAtcTggmCagTggTggAtcTggAagTggTggAtaTg 

TtgAaaGaamCtcmCttGccGacGatmCctGaaAcamCacAaaGaaTtgmCtgAa 

AtgTggGatGagGagAaaGaamCatTtaGatAcaAtgGaaAgaTtaGctGc 

AggmCtgAgcTctTggActTtgGcaTcaAcaTtgTctmCatTctTgaAggAa 

TtaTggTtamCagAagGagmCtgTttAcgGtgTagmCatTggGaaTgtmCttmCc 

mCacTtcAacmCaamCtcmCgtGttAatmCaaGcaAgcmCgcmCacmCatmCtaAtg 

Ag 

TctmCatTgcTcgTcgAggmCtamCcaAcaAacActGgcAatAccmCaaTtaAt 

TaaGaaAgtmCatTgaGgaTgcTgtmCgcTttGctmCgcmCgaAgtmCtcGtaTa 

AagTtcAtcmCtgTtgAcgGaaTcgAggmCggAgaAtgmCtgTatmCggTcaTt 

AcaGgaAatAtgAttTtgGatTtcGatTttGaaTcgGttGgtGctGccmCc 

GctGagmCtgTatTtgGctAgtGaaAtgTgtGttTttGatActTtaAatGa 

AcgAggTttGgaTcamCaaTcaGaaTtcTgtGaaAtaAgcGttTttTggGa 

AgtTctmCggTctAacAgtGtcTccmCgtTgaAtaTtcTtgTaaAatmCacAc 

AtgAccActmCaaAatActGctAaaAgaTttGcaGcgGcaGaaGccGttAa 

TtgAtaTggmCtgTacmCtgTatGgtTttTgaGgamCgtTttTtaGgaGtcGa 

AttTatTcaTtcAtcmCatGtaAacTgtAtaTttTgaAttTgtGttGtaAa 

GccAaaGcaGaaTtgTatTtgAtcTtcGgtAacmCttmCtcmCttmCgcTacAa 

AttTtgAatmCttmCtgGgaAaaTgcmCatmCcamCtcGagAaamCcgTtcmCgtTt 

mCtaAcgGagGatmCtcGccAatTatmCttTgaGagAcaAaamCtgAaamCtcmCt 

AtcTagTccmCaaTgaAtcTccmCacAtgmCtgTtamCtcGtgAtgTtcAacTc 

TttTgcTttmCatmCgcAaaAgcTcaAgaTtamCacAtgTcaGgtmCaaGccAa 

mCcgmCgamCttTaaAgaGaaGatmCatAaaTttGcaTtgTttTttGttTgtAt 

mCgaGggTgaTtcGgaGacTttmCagTaaTgtmCcaActTtcAaaTgtTtgmCa 

TagAtamCaaGatAcaTccmCtcAaaAgaAggmCctAccGtcAatGgcmCaaAg 

TcaAcgmCgtmCtaTaaAtgAatmCacAacGagGtaTcaAcaTtcTccmCccTg 

AtgmCtgAtgTtgAaaTtgmCtgGctAccGtaTtcmCaaAagAtamCtgTaaTc 

AtgAatmCcaTggmCttGgamCatmCtcmCcgTttTtcAagGgaTatAaaAatGt 

AtgmCaamCgaAttAgtGaaAaaTtcAtcmCtgGaaTaaAaaAtaAttmCtaAa 

AtcGctAcgAcaAtcTttmCcgAtgmCctTcgAagTttmCgaAagmCttTctmCt 

GagGtcGgtGgaGgaGgaAgtGgaAatTgamCggmCaaAatmCctGccmCaaGg 

mCccTctTtgGgaTttmCcamCtcAagTttActGttmCggmCagmCagTgaTatAa 

GagTtgGttmCcamCagAatGctTagGacGttTaaAttmCgtmCacAaamCttTt 
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Oligo Name 

CECYP_F08F3.7_u401_LNA3 

CECYP_F14F7.2_u397_LNA3 

CECYP_F14F7.2_u68_LNA3 

CECYP_F42A9.5_u435_LNA3 

CECYP_F42A9.5_u55_LNA3 

CECYP_K07C6.3_u3_LNA3 

CECYP_K07C6,3_u354_LNA3 

CECYP_K07C6.4_u1 18_LNA3 

CECYP_K07C6.4_u87_LNA3 

CECYP_K07C6.5_u7_LNA3 

CECYP_K07C6.5_u99_LNA3 

CECYP_K09A1 1 .3_u362_,LNA3 

CECYP_K09A1 1 .3_u48_LNA3 

CECYP_K09A1 1 .4_u238_LNA3 

CECYP_K09A1 1 .4_u68_LNA3 

CECYP_K09D9.2_u151_LNA3 

CECYP_K09D9.2_u866_LNA3 

CECYP_T1 0B9.1 0_u41 O^LNA 

CECYP_T10B9.10_u56_LNA 

CECYP_T10B9.7_u102_LNA3 

CECYP_T10B9.7_u267_LNA3 

CECYP_T1 9B 1 0. 1 _u 1 00_LN A3 

CECYP_T1 981 0. 1_u31 9_LNA3 

CECYP_Y49C4A.9_u121_LNA3 

CECYP_Y49C4A.9_u41 3_LNA3 

CECYP_ZK1 77.5_u394_LNA3 

CECYP_ZK1 77.5_u445_LNA3 

CEDAO_C47A1 0.5_d9_LN A3 

CEDAO_C47A10.5_u269_LNA3 

CEDC_C01A2.3_u373_LNA3 

CEDC_C01A2.3_u96_LNA4 

CEDC_C34F6.1_u301_LNA3 

CEDC_C34F6.1_u450_LNA3 

CEDC_F33D11.3_u126_LNA3 

CEDC_F33D1 1 .3_u14_LNA3 

CEDC,F46E10.2_u392_LNA3 


Sequence 

inCaaTatGgtTccmCatTttAgcAacTcaTatGaamCacAgaAgaTgtmCctTg 

GaaAaaGgcGtcGacAttTtaTgtGacAcgTggAcamCttmCacTatGacAa 

TaaTtgAatTacGggTctTttGtamCatAttAatTttAgtAtamCttTgtGa 

AtaTcaAtgmCaamCtaTtaAtgAatmCacAacGtcTtgmCcaAtcTtcTccmCg 

GgaGtgActAtgAaaGcaAagAgtTacmCgaTtgAaamCtgAaaGacAgamCa 

AatmCttTaaTgaTaaTtlAtgGgaTctGtaTttmCtcTttmCtgTcaAtaAa 

AtgAgcmCcamCaaAtgTaaAagGatAcgAgaTtgAttmCggGaamCagTcaTg 

AtcmCtgmCgaTatGacAttAagmCcamCatGgtTctGaamCctTcaAcaGaaGa 

mCtgAacmCttmCaamCagAagAtaAacTtcmCgtAtaGcgmCtgGaaAaamCtcmC 

t 

AttTaaAggAatTcamCagmCtcAaaAaaTaaTaamCtamCcgGttmCagAgaTt 

AatTtgAgcmCacAtgGcaAgtTatmCaamCagAggAgamCaaTgcmCgtAcaGt 

TgamCatTctActTaaAggGaaGaaAatAccAacTggTacmCctTgtAttTg 

TcamCcamCaaAgcmCatAcaTatGcgAgcTagTtcmCtcAggmCtgmCttAaamCc 

TtcGacAaaActAttTtgGaaAgaAcaAtcmCcaTtcAgtGtcGgcAaamCg 

TctGacAacAaaGccAtamCacGtgmCcgActAatTccAcaAtcAgcTagAa 

TtgGcaAaaGcaGaaTtgTatTtaAtcTttGgaAacmCtcmCttmCttmCgcTa 

TgaAtcTttmCaaActTatmCacTccTttTaaTacTacmCgtTccTgtTtgGa 

AttGagAttGtaTccAttGgcGtcTctTgtTcamCaaTcgAaaAtgTctmCa 

AacTgcTacTatTgcGccAtcAagTgtGctGctmCaaActTaaAtcmCagGt 

TtgAgamCagGaaAtaAgamCtaGaaTtcmCttTgaAacTggTggGaaGtgmCt 

AagAtgTcaAagAatTcaAgcmCagAacGatGgtmCcamCcgAcgAgcmCatTa 

AttGaamCcaActmCtgAaaTatAatGacAcaAaamCcaTgtmCtgGaaGtgGt 

GgcAatGtgAcaAtaTctmCcaAtgGttmCttmCacAgcAatmCatmCacGtgTt 

mCtaTtcAatmCgaTatTttAtcAcamCcaTccAgtGctGgamCctmCcaTcaTt 

GtcTcaGagAtgTgtAaaTttActTccmCtgmCaaTttGttTcamCgcAacTa 

TtcmCgaAtgTttmCcaAttGggActGaaGttTcaAgaGtcAccmCagAaaAa 

GatmCcaGcaTctTccAagmCttAcaTtcmCtcmCgtGctTgtAtcAagGaaAc 

TttGaaAacmCtgTttTatTatTaaAatAgaTaaTtgAttAgtTctGtamCg 

AtamCgtTgcActGcaTccGgcTatGagGgaGccAaaAatmCtlAggGgaGt 

GcamCttmCcaTtcAtcTctGcaGctActAtgGctTtgGtgAcaAaaGttGg 

mCcgTccAaaAgaAtgmCcaTctmCacAagTctTgaAatmCttAtaAagGtaGt 

GagGgaTcaAcaGtaAccTcgTgcGgtAttGacAagGgaTgtmCcgGaaGg 

GatGgtTctTcgAtcGcaAacAaaAcaGatGtgmCtcmCatTtamCatAcgGa 

AtgGagAaaAtgGalmCtgAtgGagTtgmCagGaaGtgAtgGagmCtcmCagGa 

TgaAtcTccAtaAatTatTcaAtgTttmCcaAatAttTaaTttAtcAatTg 

GctmCaamCacGgtAggAtcmCtaTggAacmCgtmCggAggAgcAggmCctmCggA 

9 
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Oligo Name 

CEDC_F46E1 0.2_u54_LNA3 

CEDC_F56G4.2_u382_LNA3 

CEDC_F56G4.2_u82_LNA3 

CEDC_M162.2_u103_LNA3 

CEDC_M162.2_u480_LNA3 

CEDC_R10E4.1 1_u274_LNA3 

CEDC_R10E4.1 1_u397_LNA3 

CEDC_T04C9.1_u321_LNA3 

CEDC_T04C9.1_u64_LNA3 

CEDC_W02A2.3_u32_LNA3 

CEDC_W02A2.3_u374_LNA3 

CEDC-_W05G1 1 .3_u153_LNA3 
CEDC_W05G1 1 .3_u51_LNA3 
CEDC_ZK863.5_u256_LNA3 
CEDC_ZK863.5_u324_LNA3 
CEEPHX_Y55B1BR.4_u161_LNA3 
CEEPHX_Y55B1 BR.4„u93_LNA3 
CEER_18S_u388_LNA3 
CEER_18S_u82_LNA3 
CEER_26S_u342_LNA3 
CEER_26S_u38_LNA3 
CEFOXO_R13H8.1b_u331_LNA3 
CEFOXO_R13H8.1b_u393_LNA3 
CEGAPDH_K10B3.7_u21_LNA3 
CEGAPDH_K1 0B3.7_u727_LNA3 
CEGBA_F1 1 E6.1a_u232_LNA3 
CEGBA_F1 1 EG. 1 a_u451_LNA3 
CEGLU_C02A12.1_u264_LNA3 
CEGLU_C02A1 2.1_u55_LNA3 
CEGLU_C46F1 1 .2_u271_LNA3 
CEGLU_C46F1 1 .2_u45_LNA3 
CEGLU_F26E4.12_u109_LNA3 
CEGLU_F26E4.12_u480_LNA3 
CEGLU_R07B1 .4_u166_LNA3 
CEGLU_R07B1 .4_,u38_LNA3 
CEGLU_T09A1 2.2_u220_LNA3 


Sequence 

mCgtGacAacmCtcTtaTttAttTctGtaAaamCtgAttmCgcmCaaActTttGt 

GaaGctTtcAaamCcaAatGagTtcmCttmCccGgaAtcmCcaAagAatAccAa 

AcaAtgAaaAgaGagGatGgaAagGaaAtcGaaGtcTctGttmCttGacGa 

GatGagGtamCatAacTttGtgTgcAgtTatAggmCcaTctAcaGtamCctGc 

TtcmCatmCatmCacTaamCcgAttGtcmCtgAcaTtgAtgGccAaamCcaGggAa 

TcamCatTatmCgaAcaAgtActAgtAagmCatGctGtgAtgGagTgcmCgcTa 

mCacGgaGatmCacGacAtcAaaGcgGatTgcTtaGagTgtGgaAacmCgtmCt 

ActAtcTacGtgGcamCgtTggActmCatmCatmCgaTggGaamCgamCgtAtaAg 

TctmCtgGccAgtTcamCttTgtGatmCaaTctmCagAttmCgtmCcamCacAagAt 

mCtamCttmCcgmCaaGaaGgcmCcgTcgTttmCtaAtcGatmCgaAcaTctmCacA 

c 

AtgGatGatmCgamCccActTgcmCacTgamCccAcaAtcmCcgmCacTcamCtam 
Cc 

AagAcgGagAggmCtgGagAgaAcgGtamCcgAtgGagAgcmCagGaamCtgAt 

mCcamCccAggAggAggGatAcaAgaGaaGaaAgtAcaGatTctmCcaActAa 

AgtTtcAcamCttmCttTttGccGttTtgGttmCccGttAtcAatmCcaTtgAt 

mCttTtaTatTctmCatmCaaTttGttTccTacTtgGtcAgcTgaGgaTcgTt 

TtcGgcAcaAatGgaGcaAaaGtaTcgTggTtaTtgTgaTgcGatTatTc 

mCtamCtaTgaAtgAgcTcamCtgGacTcaTttAtcAacTcgAgtmCaaAagmCc 

GttGgcGaaTctTcgGgtTcgTatAacTtcTtaGagGgaTaaGcgGtgTt 

GaamCtgAttmCgaGaaGagTggGgamCtgTcgmCttmCgaGgtTtaAcgActTc 

TgtTatTgcGaaAgtAatmCctGctTagTacGagAggAacAgcGggTtcAa 

TgcAtamCgamCttGgtmCtcTtgGtcAagGtgTtgTatTcaGtaGagmCagTc 

TgtGctmCagAatmCcamCttmCttmCgaAatmCcaAttGtgmCcaAgcActAacTt 

TtaAgamCggAacmCaaTtgmCtcmCacmCacmCatmCatAccAcgAgtTgaAcaGt 

AcaTtgmCtamCcaAggmCctAagmCcgmCttmCaaAttmCtcTaaGtcTgaAatGa 

GttGagTccAccGgaGtcTtcAccAccAtcGagAagGccAatGctmCacTt 

AgtAaaTtcmCttmCcamCgtGgaTctActmCgtGtgTtcAcaAagAtcGagGg 

GgtmCcaAtaAtgGgaGacTggTtcmCgcGcaGaaAgtTatGcaGatGatAt 

AgaAaamCttmCgtTggAccmCtgmCtaAggAgaAgtAttTcaAgcTtcTgaGc 

GagmCacmCcgAagmCtcAagmCcaTatTtgGaaAcaAgamCcaTacTctTcaAa 

GttAccmCtcTacAaaTctmCgcTtcAatmCcaAtgTtgTtcGcaGtcAccAa 

mCcgAagAgcTcgTtamCtaTgcGagGagGtgTgaAgcmCggAatAatTttTt 

AagTtcTtgGttGgamCgcGatGggAaaAttAtcAagAgaTttGgamCcaAc 

AcgAttTcaAcgTcaAaaAtgmCtaAtgGtgAtgAcgTgtmCacTttmCggAt 

AccTggGttGatGttTttGcgGctGaaAgtTtcTccAagmCtcAttGatTa 

GaaGtamCgtmCtcmCcaAagAaaAgcTacmCccAgcTtaAggmCatTgcAcaAt 

GcgmCcaGatAtgTatTcaAagAtcGagGtaAatGgtmCagAacActmCalmCc 
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Oligo Name 

CEGLU„T09A1 2.2_u335_LNA3 

CEGLU_T28A1 1 .1 1_u299_LNA3 

CEGLU_T28A1 1 .1 1_u54_LNA3 

CEGPD_B0035.5_u256_LNA3 

CEGPD_B0035.5_u478_LNA3 

CEHSP_C09B8.6_d8_LNA3 

CEHSP_C09B8.6_u286_LNA3 

CEHSP_C12C8.1_u127_LNA3 

CEHSP_C12C8.1_u1531_LNA3 

CEHSP_C47E8.5_u310_LNA3 

CEHSP_C47E8.5_u361_LNA3 

CEHSP_F26D10.3_u276_LNA3 

CEHSP_F26D1 0.3_u397_LNA3 

CEHSP_F43D9.4_u1 69_LNA3 

CEHSP_F43D9.4_u275_LNA3 

CEHSP_F44E5.4/5_u1 23_LNA3 

CEHSP_F44E5.4/5_u380_LNA3 

CEHSP_F52E1 .7_u175_LNA3 

CEHSP_F52E1 .7_u448_LNA3 

CEHSP_F54D5.8_u252_LNA3 

CEHSP_F54D5.8_u318_LNA3 

CEHUS_H26D21.1_u117_LNA3 

CEHUS_H26D21 .1_u478_LNA3 

CEMRE_ZC302,1_u169_LNA3 

CEMRE__ZC302.1_u292_LNA3 

CEMTL_T08G5.10_d127_LNA3 

CEMTL_T08G5.1 0_u45_LNA3 

CENAP_D2096.8_u356_LNA3 

CENAP_D2096.8_u70_LNA3 

CEPALF56D12.5_u241_LNA3 

CEPAI_F56D12.5_u301_LNA3 

CEPDLC07A12.4_u28_LNA3 

CEPDLC07A1 2.4_u433_LNA3 

CEPDLC14B1.1_u119_LNA3 
CEPDI_C14B1.1_u358_LNA3 
CEPGK_.T03F1 .3_d9_LNA3 


Sequence 

AatmCtamCagGgaAaaAggAttTcgAgtTgcmCgcGttTccAtgmCaaTcaAt 

AgaTggmCaaAgaAgcAtamCatAacTgaAacTctTccmCggGgaGctActAc 

TgaAtaAacGggmCcgAacTaaAtcmCatTcgTcaGtgGaaAtgGgaAacAa 

GtcmCglmCttmCctGatGctTatGaamCgcmCtaTttmCtcGaaGtaTtcAtgGg 

TgtGgaAaaGctmCtcAacGagAagAaaGcaGaaGttmCgtAtamCaaltcAa 

AtaTcgmCcgmCctGctTccTcamCcaAccmCgaAtaAcgmCaarriCaaAaamCttTa 

AagAgcmCcamCtcAtcAagGatGaaAgtGatGgaAagActmCttmCgtmCtcAg 

mCaaGatAttTtaAcaAaaAtgmCatmCaamCaaGaaGccmCaaTcaGgtTccGg 

mCttGggmCatTctGtamCggGatGctGtcAttActGtgmCctGcaTatTttAa 

AagAagmCatmCtcGaaAtcAacmCcaGacmCacGctAtcAtgAagAcamCttmCg 

AtgAaaGctmCaaGctmCttmCgtGatTccTctActAtgGgaTacAtgGccGc 

TtaAgcAgamCcaTtgAggAcgAgaAgcTcaAggAtaAgaTcaGccmCagAa 

mCgtmCttTccAagGatGacAUGaamCgcAtgGtcAacGaaGctGagAaaTa 

GtcGacTtgGctmCacAtcmCacAccGtcAtcAacAagGaaGgamCagAtgAc 

mCaaTctTgaGggAcamCgtTctmCacmCatTgaGggAcamCcamCgaGgtmCaa 

Ga 

TcamCtaAaaTgcAccAatmCtgGacAatmCttmCtgmCttmCtgmCtgGatGcgmCt 

TcaTgaAgcTaaAcaAttmCgaAaaGgaAgaTggTgaAcaAcgGgaAcgTg 

AagTatAacmCttmCcaAcaGggGtcmCgtmCcaGaamCaaAtcAagTccGaaTt 

TttAacmCatGgcmCgcAgaTtcTtcGatGacGtcGacTttGatmCgcmCacAt 

GcgTcgAaaAgaTctmCccTgaAgtmCtgmCatTgamCtgGccTtgAtaTtaTg 

AcaTagTctTcgTcaTcaAggAtaAgcrhCacAccmCgaAatTcaAgcGagAg 

TcgmCcaAcamCtcGgamCacGtgmCcaAaaTgaAtaTcaTctmCaaAtcGaaTg 

GtcGaaGttAgaAatmCcaGaaGccGatAttGttTclmCatmCaaAttmCcaAt 

ActActmCgtGgaAgaTccAatAaaGttGttTcaAcgmCgamCaaAtcGatTc 

GgcAgtGaaGatGaaGtgGcaAatTctGatGaaGaaAtgGgaAgcAgtAt 

TtgTcaAcgAccAgaAgcAaaAatTatGggAatmCgcGatAaaAtlmCaaGg 

GatGcaAgtGtgmCcaActGcgAatGtgmCtcAggmCtgmCtcAttAatTtgAa 

GacGatAtgTtcGatTtcmCcaGgaGagGacGgtGatGatGtgTcaGacTt 

GacGatAtgTtcGatTtcmCcaGgaGagGacGgtGatGatGtgTcaGacTt 

GagGtcGtcGtaAtcmCacAagGctmCcaAgaAagmCaaGtgmCtcGacAttTc 

GatActTttGgcAagmCtcGttmCcaAtcAagAagGagGtcAtcmCcaGatmCg 

GatGagGagGgamCacAccGagmCtcTaaAtcmCacAttmCcaAtamCagTtcAa 

mCttAtglccGaaGatAtcmCcaGagGatTggGacAagAacmCcaGtcAagAt 

TacmCccAgtmCgamCtaTgaTggAgamCagAaamCctmCgaGaaGttmCgaAgaA 

t 

mCtcGtcGccTccAacTtcAacGaaAttGccmCttGatGaaAccAagActGt 
TtcTatTgtTtaTtcmCttGccmCaaTagTgtAttTgtAttTatTctTtcTc 
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Oligo Name 

CEPGK_T03F1 .3_u424_LNA3 
CEPON_E01 A2.7_u223_LNA3 
CEPON_E01 A2.7_u79_LNA3 

CEPPGB_F13D12.6_u44_LNA3 

CEPPGB_F13D12.6_u440_LNA3 

CEPPS_T14G10.1__d2__LNA3 

CEPPS_T14G10.1_u240_LNA3 

CEPRDX_R07E5.2__u405_LNA3 

CEPRDX_R07E5.2_u42_LNA3 

CEPYC_D2023.2_u256_LNA3 

CEPYC_D2023.2_u427_LNA3 

CERAD_F10G7.4_u169_LNA3 

CERAD_F10G7.4_u267_LNA3 
CERAD_F32A1 1 .2_u250_LNA3 
CERAD_F32A1 1 .2_u380__LNA3 
CERAD_T04H1 .4_u274_LNA3 
CERAD_T04H1 .4_u375_LNA3 
CERAD_W06D4.6_u325_LNA3 
CERAD_W06D4.6_u34_LNA3 
CERAD_Y1 1 6A8C. 1 3_u289_LNA3 
CERAD_Y1 1 6A8C. 1 3_u59_LNA3 
CERAD_Y39A1A.23_u221_LNA3 
CERAD_Y39A1A.23_u276_LNA3 
CERAD_Y4 1 C4A. 1 4_u509_LNA3 
CERAD_Y41C4A.14_u731_LNA3 
CERAD_Y43C5A.6_u131_LNA3 
CERAD_Y43C5A.6_u429_LNA3 
CERFC_F31 E3.3_u1 28_LNA3 
CERFC_F31 E3.3_u55_LNA3 
CERPL_K1 1 H12.2_d1_LNA3 

CERPL_K1 1 H 1 2.2_u1 72_LNA3 
CERT_F36A4.7_u1396_LNA3 
CERT_F36A4.7_u2302_.LNA3 
CERT_F36A4.7_u289_LNA3 
CERT_F36A4.7_u291 9_LNA3 


Sequence 

iTiCaaAtcmCatmCtcmCcaGtgGatTtcGtcAttGctGacAagTtcGccGagGa 
GttTctGatTcgAcamCttTatGgamCcaTctmCaaGttmCtgmCgaGttTctTt 
GggAaamCaaAtgAttGttGgtAcaGtaGccmCgcmCctGctAttmCacTgtGa 
mCgaGcamCatmCatmCcaAtcGttmCctGttmCaamCaaGgcmCttmCtaAtcGttA 

g 

TgaTgaGagmCccAgtAacmCaaTtaTttGaamCcgTcaGgaTgtGcgTaaGg 

mCgtmCtaAtcGaaGaaGggGatmCgtGggmCaaTcaTaamCtaAttAacmCttmCa 

mCaaTggmCtcmCagGtcTttmCtgmCtcTtcAtaTacTtcrnCatTccGagTtgmCt 

GttmCtcTtgGagmCtgAagTtgTcgmCgtGctmCgtGtgAttmCtcActTctmCt 

TcgmCtamCcaGcaAggAatActTcaAcaAggTcaAcaAgtGatmCacAcaGa 

AagGaaAttGtaActmCgcmCcaAgaGctmCtcmCcaGgtGtcmCgtGgamCatAt 

TtgActGgaTtgGagAttGcgGaaGaaGttGatGttGaaAtcGagAgtGg 

GccAagTctmCaaGcaAtaAgtGttGatmCaaTcaGagmCcaTacGgaGagAt 

AtaTtgAgamCttmCggGacAagmCggAclTctmCatmCtgTcamCagmCaamCtgm 

Cc 

GatmCcgmCagAgaAtcGagTatTtcmCtcTcgAgamCccAtgGatAtcAacTg 

TccGttAagAagmCtcActGgaAaaAcamCacGgcTcgAacGaaAttGgaAt 

AatTtgGatGagAgcAaaGtgGaaGgaAtgGctAtcGttTtgGcaGatAt 

GtgmCtgGtcAaaAaaTgcTtgmCttmCgtTgcTtaTtcGcaTtgmCacTcgmCa 

mCttmCgaGaamCtcTtcAagTtgGaaTcaAcaGtgGcaTcgGatAcamCatGa 

GtgmCctTctGaaGccGaaGaaAacGacGatTagTtaAatGttTccAagTt 

GatAaaAtcGatAgcGacGacGatGagGaaGccGatGatGagGagmCtcGa 

GcaGgtGgaTacGgaTgtGgaGctGacTttTgcGttTtaTcaAgaAtcTc 

TccmCgtAgaAgtAgaAatGctAgaAgaAccTgaAcaAgaAgaTcaAgaAa 

TgcAagAtgTcaGtaTtgAaamCaaTtcmCtgTagAgamCccmCcgAagAaaAt 

AgtmCtcGtaTccGggAatGttTcaGccTgtGaaAatGctTgtTgaAgamCg 

mCttmCaaAacmCgtmCgcTttTaaGgaTacAggAacGtgGcamCgcTtcmCgaGg 

mCagAttGtamCctTcgAaaAggAaaAggAgaGaaTcgmCgtmCgcAaaAatGg 

TgaTggmCtlTgaTtaTtcGagmCagGagmCaaTgaTgtmCcgAgaGtcGttAt 

mCaaTgamCgaGaaTatTggAgtAatGggGaaActGgtTgcGacTtgmCgaAa 

TtgGaaAacAatmCtcmCtcGacTttmCtgmCtcActmCttmCgtGaaActAtcmCa 

TctTgtTatTttAttTtgTttTggGctTgtTccGaaAatGaaAtgGttGt 

mCaaTggAtcAccAagmCcaGttmCacAagmCacmCgtGagmCaaAgaGgamCtc 

Ac 

mCttTgtGatGtgAtgActGcgAagGgamCacTtgAtgGctAttAcgAgamCa 
GagmCcaGctActmCagAtgAcamCtcAacAcgTtcmCatTatGcaGgaGttTc 
TacActmCcaTccTcgmCcgAcaTacAatmCcaAcaTctmCcamCgcGgaTtcTc 
AtgGagAagAtgGttTggAtgGaaTgtGggTtgAgaAtcAgaAtaTgcmCg 
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Oligo Name 

CERT_F36A4.7_u4269_LNA3 
CERT_F36A4.7_u5485_LNA3 
CESLC_F52F 1 2. 1 a_u249_LNA3 

C ESLC_F52F 12.1 a_u76_L N A3 
CESLC_K1 1 G9.5_u400_LNA3 
CESLC_K1 1G9.5_u462_LNA3 
CESLC_Y32F6B.1_u179_LNA3 
CESLC_Y32F6B.1_u280_LNA3 
CESLC_Y37A1 C. 1 a_u 1 04_LNA3 
CESLC_Y37A1 C.1 a_u404_LNA3 
CESLC_Y70G10A.3_u383_LNA3 
CESLC_Y70G10A.3_u46_LNA3 
CES0D_C15F1 .7_u435_LNA3 
CESOD_C1 5F1 .7_u9__LNA3 
CES0D__F1 0D1 1 .1_u326_LNA3 
CES0D_F1 0D1 1 . 1_u477_LNA3 
CESULT__EEED8.2_u316_LNA3 
CESULT_EEED8.2_u82_LNA3 
CESULT_Y113G7A.1 1_u252_LNA3 
CESULT_Y1 13G7A.1 1_u96_LNA3 
CESULT_Y67A10A.4_u108_LNA3 
CESULT_Y67A1 0A.4_u327_LNA3 
CET0P0_K1 2D1 2. 1_u398_LNA3 
CETOPO_K12D12.1_u449_LNA3 
CETOPO^MOI E5.5b_u256_LNA3 
CETOPO_M01 E5.5b_u429_LNA3 
CEUbLF25B5.4_u1 86_LNA3 
CEUbi_F25B5.4_u2_LNA3 
CEUbLF29B9.6_u145_LNA3 
CEUbLF29B9.6_u230_LNA3 
CEUbLM7.1_u239_LNA3 
CEUbLM7.1_u53_LNA3 
CEUGT_F39G3.1_u40_LNA3 
CEUGT_F39G3.1_u466_LNA3 
CEUGT_M88.1_u480_LNA3 
CEUGT_M88.1_u72_LNA3 
YAL009W u145 LNA3 


Sequence 

AacmCggGatAccGtgTcgAacGtcAcaTgaAagAtgGcgAtaTaaTcgTc 

GagGagAttAaamCgcAtgTcaGtgGctmCatGtcGagTttmCcaGaaGtcTa 

AgaTatTgcmCtcTacTtaTcaTggGccTgaTggmCttTgtmCtgmCcgGtaTt 

GaaTctmCaamCcamCttmCtgGaamCccmCatAcamCcaAtgGatAgaAgamCgg 

Ag 

GttGttmCttTttTccGtgAtcTttTcaTgtTtaTgtmCtgAacGtgGcaGg 

GacTcgTtgGtgTctTgcTagGatGtcTtgGgtTcaTtcmCtcAatmCgtTg 

GtamCtgGgcTcgAggGctGaaActAatmCgaAgaAgaAacTccAgaAgaTa 

GgaTcaTgcTctGttTacGacActGatGagTtaAgaGtcAgamCtgmCacGt 

mCgaTggTtcTtcTcgTctAtcAtaTcgGggTagTtgmCcgAagTgtTgaAa 

mCaaAtcGaamCtgGtalaaAggAggAccGacGgaGacGaaTttGaamCgaGa 

AttmCgaTcaAagAacTctGgcTctmCggmCgtTaamCtgGacAttTgtTcgTc 

mCtcmCccGagmCagGcgAttAttmCacGctAgtTatGctmCaaAtgTgaTctGt 

mCcgGtamCtaTctGgaTcamCacAgaAgtmCcgAaaAtgAccAggmCagTtaTt 

mCccAgtGacTacmCtgAatmCgcGtcTctGaaTctmCcamCacAatTccTacTa 

GgaGttGctmCacmCgcAatTaaGagmCgamCttmCggAtcTctGgaTaaTctTc 

AaaTtgAggAaaAgcTtcAcgAggmCggTctmCcaAagGaaAcgTcaAagAa 

mCaaTcgTacmCatGaaAgaAgtTggAagmCcamCgtGcaAgaGaaGaaAtcmCa 

AagAagAttmCctGacmCagAgaGacTcamCgtGctTacmCcaAgaAgcAtcTa 

AgcAttGgtGgaAatAcgAaalggmCatGggAagAgaAacmCccTctmCaaTt 

mCtgGttAcgGtaGtgTatGgtmCccTgtmCctmCtcAgaAtgmCaaAtaTgtmCg 

TctAcgTcgAtgGaaAagmCcgAttTaamCaaTcaAagmCcaAcaAcgmCagTt 

GgaAagGtgmCcaAaaAgtTgamCagmCaaltgGagGatmCttAttmCatTgcmCa 

AgaTgaTgaTgaAgtTccTgcAaaGaaGccTgcTccAgcGaaGaaAgcTg 

AaaAccTcgTacTggAaaAggAgcTgcGaaAgcGgaAgtTatmCgaTttGt 

GagAagGccmCagAagAagTacGacAgamCtgAagGagmCagTtgAaaAagTt 

TtcTgtmCatAcaAtcGtgmCtaAtcGgcAggTtgmCgaTccTttGtaAccAt 

AagmCttmCggAcamCcaTtgAgaAtgTcaAagmCcaAaaTccAggAtaAggAg 

AatmCgaAccmCatmCaaTtcActmCgtTatTccTccTcgAtcTccGttmCaaGt 

mCtgAacmCatmCcaAatAttGaaGatmCcaGctmCagGctGaaGccTatmCagAt 

mCgtGtgmCttAtcTctTctGgaTgaAaamCaaGgaTtgGaaGccGtcAatmCt 

mCggAagmCatmCtgmCctTgamCatTctmCcgTtcGcaGtgGtcGccGgcTctG 

AaaGtamCgcTatGtgAggAggmCtaAcamCcaTtcAtaTaaGaamCgcAgcmCa 

TgtTgcmCgtAgaAgaGagActAaaActAagAacGatTgaTtgAagGtcTg 

TacAatTctTtgmCagGaaGcaAtaTccGccGgaGtcmCccmCttAtcActAt 

mCtcAcgGagGttAtaAttmCtaTgcAggAggmCaaTttmCtgmCtgGagTtcmCa 

AccGttTcaTgaGagmCtgTaaTcaGgtGttGttTctGtaAaaAgtGtgAa 

GtgGatGtgAaaTtaGtcmCtcAacmCccAgaGcaTttAgtGcaGagAttAg 
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Oligo Name 

YAL009W_u341_LNA3 

YAL059W_u262,LNA3 

YAL059W_u51_LNA3 

YER109C_u109_LNA3 

YER109C_u436_,LNA3 

YHR152W_u128_LNA3 

YHR152W_u510_LNA3 

YKL130C„u211_,LNA3 

YKL130C,u85_LNA3 

YKL178C_u199_LNA3 

YKL178C_u367_LNA3 

YLR443W_,u179_LNA3 

YLR443W_u86_LNA3 

YOR092W_u251_LNA3 

YOR092W_u82_LNA3 

YPL263C_u132_LNA3 
YPL263C u257 LNA3 


Sequence 

GcaGtlTaaTgtGaaGctAgtTaaAgtAcaGtcTacGtgGgamCgaGaaAt 

AttGccAaglccAttTctmCgtGccAagTacAttmCaaAatAcaAgaAagGc 

AgamCtcmCtamCaaAtaGatTcgGtgTccTgcmCagAcgAtgTtgAagAatAg 

TtgAagTttGggAatAttGgtAtgGttGaaGacmCaaGgamCcgGatTacGa 

GagGcgmCaaGtaGgcAatGatTcaAgaAgtAgtAaaGgcAatmCgtAacAc 

TgaGcamCaaAgtTaaGatGttmCggAaaGaaAaaGaaAgtmCaaTccTatGa 

mCaaGlgAccAatmCagmCacGcamCggmCttmCcaTccTcaAgamCtgAtaTtam 

Cc 

AttAaaTgcGcaGatGagGacGgaAcgAatAtcGgaGaaActGatAatAt 

GatGgtAagmCtgAgcGccTtgGacGaaGaaTttGatGttGtcGctActAa 

TacGtcAcgmCaaGgamCagAgcTttGacGacGaaAtaTcamCttGgaGgalt 

TctmCccTgtGtaGgtAcamCcaAtaTcamCaaGcgmCatTtcTatGtcGacTa 

TgcTaamCacmCagTttAgamCcaTggAaaTccmCacmCgcAaaTatAagmCaaTg 

GcaGgamCatAagAttmCcgGtcAagmCaamCgamCagTgaAgaAaglatGcaAa 

mCcgTctAgtGaaAgcGggAtgGctAaaTtgGgaAaamCgamCaaGatGttAt 

GatGctTcaAtaTccTttGatGgtmCgtTagTttAccAttTttGgtGtcTt 

iCatTtgAgtTatGtgAagAccGttGgtGggAaaGaaGagAtcAggTg 

GtcTtgGctAccAcamCccAaaAccGttmCgaAacTttAagAgcAttmCtamCt 


Example 9. Performance Analysis of LNA Oligonucleotide Capture Probes Designed to 
Detect Ratios of Splice Variants in mRNA Pools. 

Oligonucleotide Design for Microarrays. 

The methods for designing exon-specific internal oHgonucleotide capture probes are 
described above. 

Design of the LNA-modified Capture Probes 

For the internal LNA-modified oligonucleotide capture probes, every third DNA 
nucleotide was substituted with an LNA nucleotide. The probes designed to capture the junction . 
of the recombinant splice variants were designed with LNA modifications in a block of five 
consecutive LNAs nucleotides, two on the 5' side of the splice junction and three on the 3' side 
of the splice junction. All capture probes are shown in Table 14. 
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Table 14. Internal, exon-specific and merged, exon-exon junction specific oligonucleotide 
capture probes used in this example. (SEQ ID NOs: ) 


Capture probes 


Sequence (LNA=uppercase, DNA lowercase letters) 

mCctGaaAgtAgaTttGttAttTccGaaAcgmCctTctmCccGttmCttAagTc 
mCatAtamCcamCaaAtaGtcmCctmCaaAaaTcamCaaGaaAacTcamCaamCacTg 
GatTtgmCagmCggTggTaaAaaGtaTgaAaamCgtGgtAatTaaAagGtcTc 
mCcaAtgAaaActAatmCaaAggTaaAcgTggAtcmCcaTggmCaaTtcmCcgGg 


gene78.01a_50_LNA3 
gene78.01b_50_LNA3 
gene78.03a^50_LNA3 
gene78.03b^50_LNA3 


gene78.m01INS3_50_block caacactgcccagaggttcaatcGATmCmCgatgatcctaatgaaggcgccc 
gene78.mINS303_50_block gtccagtatcgtccatcatAGTATcgataaatatgtgaaggaaatgcctg 
gene78.m01INS4_50_block caacactgcccagaggttcaatcGATGTgtgataggatcagtgttcaggg 
gene78.mINS403_50_block gaaggcgaaggagactgctAATATcgataaatatgtgaaggaaatgcctg 

Printing and Coupling of the Splice Isoform-specific Microarrays 
5 The splice variant capture probes were synthesized with a 5' anthraquinone (AQ)- 

modification, followed by a hexaethyleneglycol-2 (HEG2) linker. The capture probes were first 
diluted to a 20 \\M final concentration in 100 mM Na-phosphate buffer pH 7.0, and spotted on 
the Immobilizer polymer microarray slides (Exiqon, Denmark) using the Biochip Arrayer One 
(Packard Biochip Technologies, USA) with a spot volume of 2x 300 pi and 300 nm between the 
10 spots. The capture probes were immobilized onto the microarray slide by UV irradiation in a 
Stratalinker with 2300 njoules (Stratagene, USA). Non-immobilized capture probe 
oligonucleotides were removed from the slides by washing the slides two times 15 minutes in 
IxSSC. After washing, the slides were dried by centrifugation at lOOOxg for 2 minutes, and 
stored in a slide box until microarray hybridization. 


Construction of Splice Variant Vectors 

The recombinant splice variant constructs were cloned into the Triampl 8 vector 
(Ambion, USA). The constructs were sequenced to confirm their construction. The plasmid 
clones were transformed into E, coli XLIO-Gold (Stratagene, USA). 
20 Triampl 8/SWI5 Vector Construct 

Genomic DNA was prepared from a wild type standard laboratory strain of 
Saccharomyces cerevisiae using the Nucleon MiY DNA extraction kit (Amersham Biosciences, 
USA) according to the supplier's instructions. Amplification of the partial yeast gene was 


15 
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performed using standard PGR using yeast genomic DNA as template. In the first step of 
amplification, a forward primer containing a restriction enzyme site and a reverse primer 
containing a universal linker sequence were used. In this step, 20 bp was added to the 3 '-end of 
the amplicon, next to the stop codon. In the second step of amplification, the reverse primer was 
5 exchanged with a nested primer containing a poly-T2o tail and a restriction enzyme site. The 
SWI5 amplicon contains 730 bp of the SWI5 ORF plus 20 bp universal linker sequence and a 
poly-A2o tail. 

The PGR primers used were; 
YDR146G-For-EcoRI: acgtgaattcaaatacagacaatgaaggagatga (SEQ ID NO: ) 

10 YDR146C-Rev-Uni: gatccccgggaattgccatgttacctttgattagttttcattggc (SEQ ID NO: ) 
Uni-polyT-BamHI: acgtggatccttttttttttttttttttttgatccccgggaattgccatg, (SEQ ID NO: ) 

The PGR amplicon was cut with the restriction enzymes, EcoRl + BamHh The DNA 
fi-agment was ligated into the pTRlamplS vector (Ambion, USA) using the Quick Ligation Kit 
(New England Biolabs, USA) according to the supplier's instructions and transformed into E. 

1 5 coll DH-5a by standard methods. 

Construction of the Recombinant Splice Variant #1 (Triampl8/swi5-rubisco) 

The Arabidopsis thaliana Rubisco small subunit ssu2b gene fragment (gi 17064721) was 
amplified from genomic DNA by primers named DJ305 5'- 
AGTATGATGGAGGATAGTGGAG-3' (SEQ ID NO: ) and DJ306 5'- 

20 ATTGGATCGATCCGATGATCCTAATGAAGGC-3' (SEQ ID NO: ), containing CM 

restriction site linkers. The purified PGR fi-agment was digested with Glal and then cloned into 
the swi5 (gl:7839148) vector at the unique Clal site (atcgat) giving each insert a flanking 
sequence from the original yeast SWI5 insert (named exonOl and exon 03, Fig. 1 1). The product 
was inserted in the reverse orientation, so that the insert sequence is: 

25 

atcgatGGGATGATGGTAATGAAGGGGGGGGGGTAGTGGTTGTTGGATTCTTCAAGTTCC 
TTCAAGACTTGAGCGGAGTGGGTGCATGCGAAGAATGGAAGCTTCGACATTGTGGAG 
TATGGTGGATGATAGTatcgat (SEQ ID NO: ) 

30 Nucleotide sequence analysis revealed a difference between the sequence of A, thaliana 

rubisco expected from the GenBank database and that obtained from all sequenced constructs 
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and PCR products. Position 30 in the Rubisco insert is "C" rather than the expected "A". This 
SNP was probably created by PCR. None of the oligonucleotide capture probes used in the 
example cover this region. Rubisco sequence in genbank is TCCTAATGAAGGCGCCA (SEQ 
ID NO: ). The sequence obtained from the plasmid contruct is TCCTAATGAAGGCGCCC 

5 (SEQ ID NO: ). 

Construction of the Recombinant Splice variant # 2 (Triampl 8/swi5-lea) 

The Arabidopsis thaliana Lea gene (gi 1526423) was amplified from genomic DNA with 
primers named DJ307 5'-GGAATTATCGATGTGTGATAGGATCAGTGTTCAG-3' (SEQ ID 
NO: ) and DJ308 5'-AATTGGATCGATATTAGCAGTCTCCTTCGCC-3' (SEQ ID NO: ) 

10 including the Clal linker sites as above. The PCR fragment was digested with Clal cloned into 
the yeast SWI5 IVT construct as above at the unique Clal site. The fragment was inserted in the 
forward orientation, resulting in the following insert sequence: 

atcgatGTGTGATAGGTTCAGTGTTCAGGGCTGTCCAAGGAACGTATGAGCATGCGAGA 
GACGCTGTAGTTGGAAAAACCCACGAAGCGGCTGAGTCTACCAAAGAAGGAGCTCA 
15 GATAGCTTCAGAGAAAGCGGTTGGAGCAAAGGACGCAACCGTCGAGAAAGCTAAG 
GAAACCGCTGATTATACTGCGGAGAAGGTGGGTGAGTATAAAGACTATACGGTTGA 
TAAAGCTAAAGAGGCTAAGGACACAACTGCAGAGAAGGCGAAGGAGACTGCTAATa 
tcgat (SEQ ID NO: ). 

20 Preparation of Target 

In vitro RNA Preparation from Splice Variant Vectors 

In vitro RNA from the splice variants were made using the MEGAscript^"^ high yield 
transcription kit according to the manufacturer's instructions (Ambion, USA). The yield of IVT 
RNA was quantified at a Nanodrop spectrophotometer (Nanodrop Technologies, USA). 

25 

Isolation of Total RNA from C elegans 

C elegans wild-type strain (Bristol-N2) was maintained on nematode growth medium 

(NG) plates seeded with Escherichia coli strain OP50 at 20 V, and the mixed stages of the 

nematode were prepared as described in Hope, I. A. (ed.) " C. elegans - A Practical Approach 

30 Oxford University Press 1999. The samples were immediately flash frozen in liquid N2 and 

stored at - 80 °C until RNA isolation. 
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A 100 aliquot of packed C. elegans worms from a mixed stage population was 
homogenized using the FastPrep Biol 01 from Kem-En-Tec for 1 minute, speed 6 followed by 
isolation of total RNA from the extracts using the FastPrep Biol 01 kit (Kem-En-Tec) according 
to the manufacturer's instructions. The eluted total RNA was ethanol precipitated for 24 hours at 
5 -20''C by addition of 2.5 volumes of 96% EtOH and 0.1 volume of 3M Na-acetate, pH 5.2 
(Ambion, USA), followed by centriftigation of the total RNA sample for 30 minutes at 13200 
rpm. The total RNA pellet was air-dried and redissolved in 10 ^1 of diethylpyrocarbonate 
(DEPC)-treated water (Ambion, USA) and stored at - %(fC, 
Fluorochrome'labelling of the Target 

10 The following fluorochrome-labelled cDNA targets were synthesized to test the 

performance of 'merged' probes that span exon borders. Synthetic RNAs corresponding to the 
splice variant #1 (exon01-INS3-exon03 (1-INS3-3) and splice variant #2 (exon01-INS4-exon03 
(1-INS3-3) were spiked into lO^g of C. elegans reference total RNA sample in two different 
ratios. The first target pool (KU007) contained 10 ng of splice variant #1 (1-INS3-3) transcript 

15 and 2 ng of variant #2 (1-INS4-3) transcript, a ratio of 5:1. The second target pool (KU008) 
contained 2 ng variant #1 (1-INS3-3) transcript and 10 ng of splice variant #2 (1-11^84-3) 
transcript, a ratio of 1 :5. Both mRNA pools were combined in separate labeling reactions with 5 
^Ag anchored oligo(dT2o) primer and DEPC-treated water to a final volume of 8 |Lil. The mixture 
was heated at 70^C for 10 minutes, quenched on ice for 5 minutes, followed by addition of 20 

20 units of Superasin RNase inhibitor (Ambion, USA), 1 |Lil dNTP solution (lOmM each dATP, 
dGTP, dTTP and 0.4 mM dCTP, and 3 ^1 Cy5-dCTP, Amersham Biosciensces, USA), 4 |il 5 x 
RTase buffer (Invitrogen), 2\i\ 0.1 mM DTT (Invitrogen), 400 units of Superscript II reverse 
transcriptase (Invitrogen, USA) and DEPC-treated water to 20 ^1 final volume. Background 
hybridization to merged capture probes was monitored in both hybridizations using the other 

25 fiior channel with lOjig of C. elegans reference RNA alone labeled with Cy3-dCTP, according to 
the labeling method described above for the splice variant spikes. All four cDNA syntheses were 
carried out at ATC for 2 hours, and the reaction was stopped by incubation at 70°C for 5 minutes, 
followed by incubation on ice for 5 minutes. 

Unincorporated dNTPs were removed by gel filtration using MicroSpin S-400 HR 

30 columns as described below. The column was pre-spun for 1 minute at 1500 xg in a 1 .5 ml tube, 
and the colunm was placed in a new 1 .5 ml tube. The cDNA sample was slowly to the top center 
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of the resin, spun 1500-xg for 2 minutes, and the eluate was collected. The RNA was hydrolyzed 
by adding 3 ^1 of 0.5 M NaOH, mixing, and incubating at 70 °C for 15 minutes. The samples 
were neutralized by adding 3 ^1 of 0.5 M HCl and mixing, followed by addition of 450 \il IxTE, 
pH 7.5 to the neutralized sample and transfer onto a Microcon-30 concentrator (prior to use, 500 
5 ^l IxTE was spun through the column to remove residual glycerol). The samples were 
centrifuged at 14000-xg in a microcentrifuge for 1 2 minutes. Spinning was continued until 
volume was reduced to 5 ixl. The labelled cDNA probes were eluted by inverting the Microcon- 
30 tube and spinning at 1 000-xg for 3 minutes. 

Microarray Hybridization 
10 The fluorochrome-labelled cDNA samples, respectively, were combined (the two 

different ratios separately). The following were added: 3.75 jil 20x SSC (3x SSC final, which 

was passed through a 0.22 jifilter prior to use to remove particulates) yeast tRNA (1 \i§J\x\ final) 

0.625 \i\ 1 M HEPES, pH 7.0 (25 mM final, which was passed through 0.22 jifilter prior to use to 

remove particulates) 0.75 ^il 10 % SDS (0.3 % final) and DEPC-water to 25 ^1 final volume. 

15 The labelled cDNA target samples were filtered in Millipore 0.22 \i filter spin column (Ultrafree- 
MC, Millipore, USA) according to the manufacturer's instructions, followed by incubation of the 
reaction mixture at 100 °C for 2-5 minutes. The cDNA probes were cooled at room temp for 2-5 
minutes by spinning at maximum speed in a microcentrifuge. A LifterSlip (Erie Scientific 
Company, USA) was carefiilly placed on top of the microarray spotted on Immobilizer™ 

20 MicroArray Slide, and the hybridization mixture was applied to the array from the side. An 
aliquot of 30 |llL of 3xSSC was added to both ends of the hybridization chamber, and the 
Immobilizer™ MicroArray Slide was placed in the hybridization chamber (DieTech, USA). The 
chamber was sealed watertight and incubated at 65°C for 16-18 hours submerged in a water bath. 
After hybridization, the slide was removed carefully from the hybridization chamber and washed 

25 using the following protocol. The slides were washed sequentially by plunging gently in 2 x 
SSC/0.1% SDS at room temperature until the cover slip falls off into the washing solution, then 
in Ix SSC pH 7.0 (150 mM NaCl, 15 mM Sodium Citrate) at room temperature for 1 minute, 
then in 0.2 x SSC, pH 7.0 (30 mM NaCl, 3 mM Sodium Citrate) at room temperature for 1 
minute, and finally in 0.05 x SSC (7.5 mM NaCl, 0.75 mM Sodium Citrate) for 5 seconds, 

30 followed by drying of the slides by spinning at 1 000 xg for 2 minutes. The slides were stored in 
a slide box in the dark until scanning. 
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Microarray data analysis 

The splice variant microarray was scanned in a ScanArray 4000XL confocal laser 
scanner (Packard Instruments, USA). The hybridization data were analysed using the GenePix 
Pro 4.01 microarray analysis software (Axon, USA). Only the Cy5 (650 nm) data were 
5 examined as both hybridizations produced comparable, and acceptably low, signal from the C 
elegans reference RNA alone (Cy3 channel). 
Normalization 

Data was normalized so that it could be compared between hybridizations. Both 
hybridizations contained the same amount of RNA from synthetic exons 01 and exons 03 (10+2 

10 ng), so signal from the capture probes designed to internal regions of these exons is expected to 
be equal. The ratio of raw Cy5 signal between the two different labeled cDNA target pools, 
designated as KU007 and KU008 hybridizations, for each probe corresponding to either of these 
exons was calculated, that is for each probe /.we calculated the ratio probe/KUOO?/ 
probe/KUOOS). The average of all of these ratios was used as the normalization ratio. 

1 5 Expectations of normalized data. 

To reflect the proportions of RNA spiked into the hybridization, the ratio of signal in 
hybridization KU007/KU008 should be 5 for probes designed to exon junctions of the INS3 
splice variant #1 and 0.2 for probes corresponding to 1-INS4 splice variant #2. Data was log2 
transformed: log2(5)=+2.32, log2(0.2)=-2.32. The merged probe corresponding to the exon 01- 

20 exon 03 border desirably produces a consistently low value that is desirably independent of 
which transcript was more abundant, i.e., Iog2(ratio)=0. 
Array results 

Results are sunmiarized in Table 15. 50-mer capture probes containing LNA in a block 
spanning exon-exon junctions were consistent in producing the expected ratios. 

25 
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Table 15. LNA 50-mer block probes are most consistent in producing overall data closest to 
expected ratios. 


Caoture orobe 

ExDected ratio noa2) 

Observed ratio noa2^ with meraed 
LNA block capture probes 

gene78.m0103 

0.00 

-0.24 

gene78.m01INS3 

2.32 

2.93 

gene78.m01INS4 

-2.32 

-2.39 

gene78.mlNS303 

2.32 

3.11 

gene78.mlNS403 

-2.32 

-0.86 


Example 10: Improved Signal-to-noise Ratios using LNA Oligonucleotide Capture Probes 
Combined with cDNA Target Fragmentation with the E. coli Uracil-DNA Glycosylase. 

Capture Probe Design 

The capture probes were designed to a 602 -nucleotide sequence in the 3 '-region of the 
Yeast (5. cerevisiae) 70 kDa heat shock protein (SSA4) gene. The 602-base pair sequence is 
shown in Table 16. For the LNA-spiked oligonucleotide capture probes, every third DNA 
nucleotide was substituted with a LNA nucleotide. All capture probes are shown in Table 17. 

Table 16. Six hundred and two (602) base pair sequence stretch of thCiS. cerevisiae ssa4 gene. 
The underlined segments indicate the position of the capture probes. First underline is equal to 
capture probe YER103W-554, second underline is equal to capture probe YER103W-492 and so 
forth. (SEQ ID NO: ) 

gg« gaaaggac a aggac aaaagac a ac aat? ct; ac^ SS3^ aaal;t;t? gag^t; gage gg^ a't^ ccacccgc^cc aagaggc g« ac c ac 
aaa^^ gaag^^ ac a%^^ ga^ a^ c ga-^ gc aaa« ggt; a^^ ct; gaac gt; at gc c crtt gaaaaaggt act ggt aaat ct aac aagat 
t ac aatt act aac gat aagggaagatt at c gaaggaagat at c gat aaaat ggt t get gaggc agaaaagtt c aaggc c gaagat 
gaac aagaagct c aac gt gtt c aagct aagaat c agct agaat c gt ac gc gttt acttt gaaaaatt ct gt gage gaaaat aact 
t c aaggagaaggt gggt gaagaggat gc c aggaaatt ggaagc c gc c gc c c aagat get at aaatt ggtt agat get t c gc aage 
ggc c t c c ac c gaggaat ac aaggaaaggc aaaaggaact agaaggt gtt gc aaac c c c att at gagt aaatttt ac ggagct gc a 
ggt ggt gc c c e aggage agge c c agtt c c gggt get ggagc agge e c c act ggage ac c agac aac ggc c c aac ggtt gaagagg 
tt gatt ag 
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Table 17. Capture probes for the SSA4 tile array. (SEQ ID NOs: ) 


Oligo Name 

Sequence 

YER103W-1-DNA 

gccccactggagcaccagacaacggcccaacggttgaagaggttgattag 

YER103W-38-DNA 

gccccaggagcaggcccagttccgggtgctggagcaggccccactggagc 

YER103W-73-DNA 

ccattatgagtaaattttacggagctgcaggtggtgccccaggagcaggc 

YER103W-92-DNA 

ctagaaggtgttgcaaaccccattatgagtaaattttacggagctgcagg 

YER103W-127-DNA 

cctccaccgaggaatacaaggaaaggcaaaaggaactagaaggtgttgca 

YER103W-200-DNA 

ggtgaagaggatgccaggaaattggaagccgccgcccaagatgctataaa 

YER103W-245-DNA 

actttgaaaaattctgtgagcgaaaataacttcaaggagaaggtgggtga 

YER103W-272-DNA 

aagaatcagctagaatcgtacgcgtttactttgaaaaattctgtgagcga 

YER103W-336-DNA 

aatggttgctgaggcagaaaagttcaaggccgaagatgaacaagaagctc 

YER103W-393-DNA 

taacaagattacaattactaacgataagggaagattatcgaaggaagata 

YER103W-447-DNA 

cgatgcaaatggtattctgaacgtatctgccgttgaaaaaggtactggta 

YER103W-492-DNA 

acccgctccaagaggcgtaccacaaattgaagttacatttgatatcgatg 

YER103W-554-DNA 

ggtgaaaggacaaggacaaaagacaacaatctactgggtaaatttgagtt 

YER103W-1-LNA1 

GccmCcamCtgGagmCacmCagAcaAcgGccmCaamCggTtgAagAggTtgAttAg 

YER103W-38-LNA1 

GccmCcaGgaGcaGgcmCcaGttmCcgGgtGctGgaGcaGgcmCccActGgaGc 

YER103W-73-LNA1 

mCcaTtaTgaGtaAatTttAcgGagmCtgmCagGtgGtgmCccmCagGagmCagGc 

YER103W-92-LNA1 

mCtaGaaGgtGttGcaAacmCccAttAtgAgtAaaTttTacGgaGctGcaGg 

YER103W-127-LNA1 

inCctmCcamCcgAggAatAcaAggAaaGgcAaaAggAacTagAagGtgTtgmCa 

YER103W-200-LNA1 

GgtGaaGagGatGccAggAaaTtgGaaGccGccGccmCaaGatGctAtaAa 

YER103W-245-LNA1 

ActTtgAaaAatTctGtgAgcGaaAatAacTtcAagGagAagGtgGgtGa 

YER103W-272-LNA1 

AagAatmCagmCtaGaaTcgTacGcgTttActTtgAaaAatTctGtgAgcGa 

YER103W-336-LNA1 

AatGgtTgcTgaGgcAgaAaaGttmCaaGgcmCgaAgaTgaAcaAgaAgcTc 

YER103W-393-LNA1 

TaamCaaGatTacAatTacTaamCgaTaaGggAagAttAtcGaaGgaAgaTa 

YER103W-447-LNA1 

mCgaTgcAaaTggTatTctGaamCgtAtcTgcmCgtTgaAaaAggTacTggTa 

YER103W-492-LNA1 

AccmCgcTccAagAggmCgtAccAcaAatTgaAgtTacAttTgaTatmCgaTg 

YER103W-554-LNA1 

GgtGaaAggAcaAggAcaAaaGacAacAatmCtamCtgGgtAaaTttGagTt 

Control capture probi 

es 

YFL039C-50 

acaagaatacgacgaaagtggtccatctatcgttcaccacaagtgtttct 

YFL039C-50_LNA3 

AcaAgaAtamCgamCgaAagTggTccAtcTatmCgtTcamCcamCaaGtgTttmCt 

YDR146C-50 

tgggaatggaacggggattatggtttcgccaatgaaaactaatcaaaggt 

YDR146C-50_LNA3 

TggGaaTggAacGggGatTatGgtTtcGccAatGaaAacTaaTcaAagGt 
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. Printing and Coupling of the Yeast SSA4 Tile Microarrays 

The SSA4 capture probes were synthesized with a 5' anthraquinone (AQ)-modification, 
followed by a hexaethyleneglycol-2 (HEG2) linker. The capture probes (Table 17) were first 
5 diluted to a 20 jiM final concentration in 100 mM Na-phosphate buffer pH 7.0, and spotted on 
the hnmobilizer microarray slides (Exiqon, Denmark) using the Biochip Arrayer One (Packard 
Biochip Technologies) with a spot volume of 2x 300 pi and 400 |im between the spots. The 
capture probes were immobilized onto the microarray slide by UV irradiation in a Stratalinker 
with 2300 lijoules (Stratagene, USA). Non-immobilized capture probe oligonucleotides were 
10 removed from the slides by washing the slides two times 15 minutes in IxSSC. After washing, 
the slides were dried by centrifiigation at lOOOxg for 2 minutes, and stored in a slide box until 
microarray hybridization. 

Yeast Cultures 

Saccharomyces cerevisiae wild-type (BY4741, MATa; his3Al; leu2A0; met 1 SAO; 

1 5 ura3 AO) and Assa4 (MATa; his3 Al ; leu2A0; met! SAO; ura3 AO; YERl 03w: :kanMX4) mutant 

strains (EUROSCARF) were grown in YPD at 30°C until the A6oo density of the cultures reached 
0.8. Half of the cultures were collected by centrifugation and resuspended in one volume of 
40°C preheated YPD. Incubation was continued for an additional 30 minutes at 30°C or 40°C 
for the standard and heat-shocked cultures, respectively. Cells were harvested by centrifiigation 

20 and stored at -80°C. 

RNA Extraction 

Total RNA was extracted using the FastRNA Kit- RED (BIO 101) according to suppliers' 
instructions. The quantity and quality of the RNA preparations were examined by standard 
2S spectrophotometry on a NanoDrop ND-1000 (USA) and by gel electrophoresis. Only high 
quality RNA preparations were used for microarray analyses. 
Fluorochrome-labelling of the Target 

A total of seven cDNA assay mixtures were produced; each with ten (10) |ig total RNA 
from wtand combined with 5 )ig anchored oligo(dT2o) primer and DEPC-treated water to a final 
30 volume of 8 |li1. The mixtures were heated at l(fC for 10 minutes, quenched on ice for 5 

minutes, followed by addition of 20 units of Superasin RNase inhibitor (Ambion, USA), 3 ^il 
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Cy3-dCTP (Amersham Biosciences), lOmM final concentration of dATP and dGTP, 4 ^il 5 x 
RTase buffer (Invitrogen), 2|xl 0.1 mM DTT (Invitrogen), 400 units of Superscript II reverse 
transcriptase (Invitrogen, USA), dUTP and dTTP accordingly to Table 18, and DEPC-treated 
water to 20 |il final volume. A parallel set-up was made with 10 [xg total RNA from dissa4 for 
5 target cDNA labelling with Cy5-dCTP. All cDNA syntheses were carried out at 42*^C for 2 

hours, and the reaction was stopped by incubation at 70*^0 for 5 minutes, followed by incubation 
on ice for 5 minutes. Each cDNA pool (except the unfragmented control pool) was incubated at 
ZTC for 2 hours with 2 units of Uracil-DNA Glycosylase (UDG, New England Biolabs, USA) 
and by addition of 2.4 ^1 (Ix final concentration in the reaction mixture) of UDG reaction buffer. 
10 The enzyme was heat-inactivated at 95°C for 10 minutes. Unincorporated dNTPs were removed 
by gel filtration using MicroSpin S-400 HR columns as described in Example 9. 

Table 18. dUTP and dTTP ratios in cDNA target labelling. 


Assay # 

Final conc. dUTP 

Final cone. dTTP 

1 

0.5 

0 

2 

0.25 

0.25 

3 

0.125 

0.375 

4 

0.05 

0.45 

5 

0.025 

0.475 

6 

0.0125 

0.4875 

7 

0 

0.5 


15 Gel electrophoresis of the cDNA target pools 

0.5 |il of each of the seven fragmented cDNA pools were analysed on a 2% agarose-gel. 
The data show that the cDNA is fragmented linearly with respect to the concentration of dUTP 
used in the synthesis. Fig. 29 shows the gel electrophoresis of fi-agmented cDNA from the yeast 
wild-type strain. 
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Comparative Hybridization of the SSA4 Tile Array with Fluorochrome-labelled wild-type and 
Assa4 cDNA Target Pools and Post-hybridization Washes 

The fluorochrome-labelled cDNA samples, respectively, were combined (the different 
UDG-fragmented samples separately). The following were added: 3.75 ^l 20x SSC (3x SSC 
5 final, pass through 0.22 yi filter prior to use to remove particulates) yeast tRNA (1 |ig/|il final) 
0.625 \i\ 1 M HEPES, pH 7.0 (25 mM final, pass through 0.22 ^ filter prior to use to remove 
particulates) 0.75 ^l 10 % SDS (0.3 % final) and DEPC- water to 25 ^1 final volume. The 
labelled cDNA target samples were filtered in Millipore 0.22 ^ filter spin column (Ultrafi-ee-MC, 
Millipore, USA) according to the manufacturer's instructions, followed by incubation of the 

10 reaction mixture at 100 °C for 2-5 minutes. The cDNA probes were cooled at room temp for 2-5 
minutes by spinning at maximum speed in a microcentrifuge. A LifterSlip (Erie Scientific 
Company, USA) was carefully placed on top of the SSA4 microarrays spotted on Immobilizer™ 
MicroArray Slide, and the hybridization mixture was applied to the array from the side. An 
aliquot of 30 of 3xSSC was added to both ends of the hybridization chamber, and the slide 

15 was placed in the hybridization chamber (DieTech, USA). The chamber was sealed watertight 
and incubated at 65^C for 16-18 hours submerged in a water bath. After hybridization, the slide 
was removed carefully from the hybridization chamber and washed using the following protocol. 
The slides were washed sequentially by plunging gently in 2 x SSC/0.1% SDS at room 
temperature until the cover slip falls of into the washing solution, then in Ix SSC pH 7.0 (150 

20 mM NaCl, 15 mM Sodium Citrate) at room temperature for 1 minute, then in 0.2 x SSC, pH 7.0 
(30 mM NaCl, 3 mM Sodium Citrate) at room temperature for 1 minute, and finally in 0.05 x 
SSC (7.5 mM NaCl, 0.75 mM Sodium Citrate) for 5 seconds, followed by drying of the slides by 
spinning at 1 000 xg for 2 minutes. The slides were stored in a slide box in the dark until 
scanning. 

25 Microarray Data Analysis 

The slides were scanned in a ScanArray 4000XL confocal laser scanner (Packard 
Instruments, USA). The hybridization data were analysed using the GenePix Pro 4.01 
microarray analysis software (Axon, USA). 

In the data analysis, the differences in labelling efficiency between the two fluorescent 

30 dyes were scaled by using an internal normalization approach. The average signal intensities 
from the control capture probes (Table 17) were used to calculate the normalization factor. This 
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factor was multiplied to the signal intensity values from the Cy-3 target. Analysis of the data 
demonstrates that capture probes with LNA in every third position have up to 5.2 fold higher 
signal-to-noise ratios, compared to the DNA capture probes (Fig. 30). 


5 Example 11: Interpretation of Splice Array Data Using LNA Discriminating Probes. 

This example illustrates the interpretation of microarray analysis of alternative mRNA 
splicing. Different LNA capture probe design types are formalized, and the expression constarit 
0 is introduced as a measurement of alternative splicing. 
Introduction 

10 The eukaryotic pre-mRNA is the subject of Splicing and Alternative Splicing, hence 

sequences refer to RNA sequences, Original sequence refers to pre-mRNA, and splice forms 
refer to mRNA sequences. The splicing is conducted by a cellular machinery named the 
spliceosome. The terms exons and introns can be used to refer to regions of pre-RNA sequences 
(or more specifically a single splice form). It is noted that a part of the corresponding DNA/pre- 

15 mRNA sequence that is an exon (not excised) in one splice form can potentially be absent in 
another splice form (e.g., partly absent in exon truncation and completely absent in exon 
skipping). Thus, the terms "constant regions" and "variable regions" (see below) are useful for 
characterizing the process of identifying different splice forms. 

Splicing can be defined as the production of a new sequence via the excision of part(s) of 

20 an original sequence (Fig. 31). Alternative splicing can be defined as the production of more 
than one novel sequence via the excision of different parts of the original sequence. When 
comparing two different splice forms, they can be divided into a constant region that is shared by 
both sequences and a variable region by which the two splice forms differ (Fig. 32). 

Alternative splicing can be categorized in terms of (i) whether or not the variable region 

25 is flanked by a single constant region or surrounded by two constant regions, (ii) the size of the 
variable region (e.g., exon skipping/intron retention vs. extension and truncation) [(intron/exon) 
5' and 3'], and (iii) the number of variable regions (and hence the number of splice forms). 
Capture Probe Design 

Capture Probe design can be divided into 3 distinct types according to their position: 

30 Merged Probes (MP) or Junction Probes, Unique Internal Probes (UIP), and Shared Internal 
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Probes (SIP) (Fig. 33). Considering the case of a single variable region surrounded by constant 
regions, there are several different possible capture probe positions for each type (Fig. 34). 
Data Interpretation 

The aim of the analyses can be to determine (i) whether a given original sequence is 
5 subject to alternative splicing (i.e., whether there is more than one splice form present), and (ii) 
whether there is a difference in alternative splicing of the original sequence between two 
biological samples (i.e., whether the proportions between the two splice forms differ between 
biological samples). The analysis can also be used for data validation. 

Possible biases in the microarray platform include (a) noise in terms of non-specific 
10 binding and subsequent false signal, (b) differences in dye labeling efficiency, (c) differences in 
capture probe affinity, (d) differences in sample conditions (e.g., number of cells, and amount of 
RNA), and (e) differences in reverse transcriptase efficiency of different splice forms. Biases 
can be corrected for by various means of normalization and/or standardization. 
Data Analysis 

1 5 In order to analyze the expression of the different splice forms, the expression constant 0 

is introduced. 0 denotes the relation between the proportions of the signals (capture probes a 
and b) between the labeled extracts from biological samples (labeled with Cy5 & Cy3). That is, 

(Cy5a / Cy3a) = (Cy5b / Cy3b) * 0 or, 

20 

0 = (Cy5a / Cy3a) / (Cy5b / Cy3b) 
= (Cy5a * Cy3b) / (Cy5b * Cy3a) and, 

0 = (Cy5a / Cy5b) / (Cy3a / Cy3b) 
25 = (Cy5a * Cy3b) / (Cy3a * Cy5b) 

= (Cy5a * Cy3b) / (Cy5b * Cy3a) [same as above] 

Considering normalization due to different biases and given a sample normalization 
factor S due to differences between the samples in terms of amounts of RNA, RT-efficiency, dye 
30 properties, etc. and a probe normalization factor P due to differences in probes in terms of 
affinity, position in target sequence, etc., the following equations apply. 
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For two probes: a and b, a * P = b 

For two samples Cy5 & Cy3, Cy5 * S = Cy3, 


5 Thus, considering two probes from two samples the signals are: 

- Cy5a * P * S 

- Cy5b * S 

- Cy3a * P 
-Cy3b 

10 

With respect to 0: 

0 = [(Cy5a * P * S) / (Cy3a * P)]/[ (Cy5b * S)/( Cy3b)] 
= (Cy5a * P * S * Cy3b) / (Cy3a * P * Cy5b * S) 
1 5 = (Cy5a * Cy3b * P * S) / (Cy3a * Cy5b * P * S) 

= (Cy5a * Cy3b) / (Cy3a * Cy5b) * (P * S) / (P * S) 
= (Cy5a * Cy3b) / (Cy3a * Cy5b) * 1 

= (Cy5a * Cy3b) / (Cy3a * Cy5b) [same as without normalization] 

20 Note that the calculation of 0 is not affected by the normalization factors S and P, hence it is not 
necessary to normalize the array data when interpreting alternative splice arrays with the use of 
the Expression constant 0. 

Properties of 0 

If 0 = 1, there is no difference in the proportions of the targets of capture probes a and b 
25 in the two samples. Even in the case of alternative splicing, it is not possible to determine 

whether there is more than a single splice form present using this particular method. If 0 ^ 1, 
there is a difference in the proportions of the targets of capture probes a and b in the two 
samples, thus there is a difference in splice pattern and therefore there must be more than one 
splice form present. 
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Comparing 0's 

0 can be compared between different transcripts to determine whether they have 
correlated expression, and 0's from sets of capture probes from the same transcript (different 
probes) can be averaged. 
5 Example 

Considering a simple example of a single large variable region surrounded by constant 
regions using a combination of a Merged Probe and a Shared Intemal Probe. Calculating 0 of a 
single splice form can be performed using the following equation: 

10 0 = (Cy5MP * Cy3SIP) / (Cy3MP * CySSIP) 

If 0 = 1 , there is no difference in the proportions of the targets of capture probes a and b in the 
two samples, and it may not be possible to determine whether multiple splice forms are present 
using this particular method. If 0 # 1 , there is a difference in the proportions of the targets of 
15 capture probes a and b, thus there is a difference in splice pattern and therefore there must be 
more than one splice form present. 
Conclusions 

It is possible to infer difference in expression level of two capture probe targets from two 
tissues when one is comparing the proportions of signals from one capture probe with the 
20 proportion of signals from the other probe. In contrast, single signals may be subject to biases 
from normalizations and standardizations for each probe and sample. 

Example 12: Exemplary Microarrays 

The nucleic acid arrays of the invention can be generated by standard methods for either 
25 synthesis of nucleic acid probes that are then bonded to a solid support or synthesis of the nucleic 
acid probes on a solid support (e.g., by sequential addition of nucleotides to a reactive group on 
the solid support). In desirable methods for on-chip synthesis of the capture probes, 
photogenerated acids are produced in light-irradiate sites of the chip and used to deprotect the 5*- 
OH group of nucleic acid monomers and oligomers (e.g., to remove an acid-labile protecting 
30 group such as 5'-0-DMT) to which a nucleotide is to be added (Gao et al., Nucleic Acid 

Research 29:4744-4750, 2001). Standard methods can also be used to label the nucleic acids in a 
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test sample with, e.g., a fluorescent label, incubate the labeled nucleic acid sample with the array, 
and remove any unbound or weakly bound test nucleic acids from the array. Exemplary methods 
are described, for example, in U.S.P.N. 6,410,229; 6,406,844; 6,403,957; 6,403,320; 6,403,317; 
6,346,413; 6,344,316; 6,329,143; 6,310,189; 6,309,831; 6,309,823; 6,261,776; 6,239,273; 
5 6,238,862; 6,156,501; 5,945,334; 5,919,523; 5,889,165; 5,885,837; 5,744,305; 5,445,934; 
5,800,9927; and 5,874,219. 

In an exemplary method for synthesis of an array, capture probes were 
immobilized using AQ technology with a HEG5 linker (U.S.P.N. 6,033,784) onto an 
Immobilizer™ slide. An exemplary chip consists of 288 spots in four replicates (i.e., 

10 1152 spots) with a pitch of 250 \xm, and an exemplary hybridization buffer is 5xSSCT 
(i.e., 750 mM NaCl, 75 mM Sodium Citrate, pH 7.2, 0.05% Tween) and 10 mM MgCl2. 
An exemplary target is a 45-mer oligonucleotide with Cy5 at the 5* end and with a final 
concentration in the hybridization solution of 1 p.M. 

Hybridization was performed with 200 hybridization solution in a 

1 5 hybridization chamber created by attaching a CoverWell™ gasket to the Immobilizer 
slide. The incubation was conducted overnight at 4°C. After hybridization, the 
hybridization solution was removed, and the chamber was flushed with 3x1.0 mL 
hybridization buffer described above without any target nucleic acid. A coverWell™ 
chamber was then filled with 200 |iL hybridization solution without target. The slide was 

20 observed with a Zeiss Axioplan 2 epifluorescence microscope with a 5x Fluar objective 
and a Cy5 filterset from OMEGA. The temperature of the microscope stage was 
controlled with a Peltier element. Thirty-five images at each temperature were acquired 
automatically with a Photometries camera, automated shutter, and motorized microscope 
stage. The images were acquired, stitched together, calibrated and stored in stack by the 

25 software package "MetaVue" 

Arrays can be generated using capture probes of any desired length (e.g., arrays of 
pentamers, hexamers, or heptamers.) In various embodiments, 1, 2, 3, 4, 5, 6, 7, 8, or more 
nucleotides of the probes are LNA nucleotides. Desirably, at least 1,2,3, 5, 7, 9, or all of the A 
and T nucleotides in the probes are LNA A and LNA T nucleotides. LNA nucleotides can be 

30 placed in any position of the capture probe, such as at the 5' terminus, between the 5' and 3' 

termini, or at the 3' terminus. LNA nucleotides may be consecutive or may be separated by one 
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or more other nucleotides. The microarrays can be used to analyze target nucleic acids of any 
"AT" or "GC" content, and are especially useful for analyzing nucleic acids with high "AT" 
content because of the increased affinity of the microarrays of the present invention for such 
nucleic acids compared to traditional microarrays. Desirably, the array has at least 100, 200, 
5 300, 400, 500, 600, 800, 1000, 2000, 5000, 8000, 10000, 15000, 20000, or more different probes. 
If desired, nucleotides with a universal base can be included in the capture probes to increase the 
Tm of the capture probes (e.g., capture probes of less than 7, 6, 5, or 4 nucleotides). Exemplary 
"non-discriminatory" nucleotides include inosine, random nucleotides, 5 nitro-indole, LNA, 
inosine, and LNA 2-aminopurine. In desirable embodiments, 1, 2, 3, 4, 5, or more nucleotides 
10 with a universal base are located at the 5' and/or 3' termini of the capture probes. 

Example 13: Exemplary Application of Nucleic Acids of the Invention 

An exemplary application of these methods includes comparing hybridization 
pattems of cDNA or cRNA from a patient sample to classify early-tumors or detect an 

1 5 infection or a diseased state. The microarrays of the invention may also be used as a 

general tool to analyze the PGR products generated by amplification of a test sample with 
PGR primers for one or more nucleic acids of interest. For example, PGR primers can be 
used to amplify nucleic acids with a particular exon or exon-exon combination, and then 
the PGR products can be identified and/or quantified using a microarray of the invention. 

20 For identification of splice variants, PGR primers to specific exons can be used to 
amplify nucleic acids that are then applied to a microarray for detection and/or 
quantification as described herein. To detect microbial pathogens, species-specific PGR 
primers (e.g., primers specific for an exon whose sequence differs among species) can be 
used to amplify nucleic acids in a sample for subsequent analysis using a microarray. For 

25 example, the hybridization pattern of the PGR products to the array can be used to 
distinguish between different bacteria, viruses, or yeast and even between different 
strains of the same pathogenic species. In particular embodiments, the array is used to 
determine whether a patient sample contains a bacteria strain that is known to be resistant 
or susceptible to particular antibiotics or contains a virus or yeast strain known to be 

30 resistant or susceptible to certain drugs. Ghanges in product composition or raw material 
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origin can also be detected using a microarray. The arrays can also be used to determine 
the composition of mRNA cocktails. 

Exemplary environmental microbiology applications of these arrays include 
identification of major rRNA types in contaminated soil samples and classification of 
5 microbial isolates. These rRNA amplificates are formed from rRNA by rtPCR or from 
the rDNA gene by conventional PGR. Numerous general and selective primers for 
different groups of organisms have been published. Most frequently an almost ftiU length 
amplificate of the 16S rDNA gene is used (e.g., the primers 26F and 1492R). For 
purifying rRNA from a soil sample, standard methods such as one or more commercial 
10 extraction kits from companies such as QIAGEN ("Rneasy", Q-biogene "RNA PLUS," 
or "Total RNA safe" can be used. 

Example 14: Methods for Minimizing the Variance in Melting Temperatures in Nucleic 
Acid Populations of the Invention 

1 5 Any simultaneous use of more than one primer or probe is made difficult because the 

involved primers or probes must w^ork under the same conditions. An indication of whether or 
not two or more primers or probes will work under the same conditions is the relative Tms at 
which the hybridized oligonucleotides dissociate. In cases where probes are applied for specific 
detection of homologous sequences such as splice variants, the ATn, is of importance. ATm 

20 expresses the difference between Tm of the match and the Tm of the mismatch hybridizations. 
Generally, the larger ATm obtained, the more specific detection of the sequence of interest. In 
addition, a large ATm facilitates more probes to be used simultaneously and in this way a higher 
degree of multiplexity can be applied. 

High affinity nucleotide analogs such a LNA can be also be used universally to equalize 

25 the melting properties of oligonucleotides with different AT and CG content. The increased 

affinity of LNA adenosine and LNA thymidine corresponds approximately to the normal affinity 
of DNA guanine and DNA cytosine. An overall substitution of all DNA-A and DNA-T with 
LNA-A and LNA-T results in melting properties that are nearly sequence independent but only 
depend on the length of the oligonucleotide. This may be important for design of 

30 oligonucleotide probes used in large multiplex analysis. The effect of LNA A and T 
substitutions has been evaluated by predicting the Tm value of all possible 9-mer 
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oligonucleotides with different universal substitutions. The distribution of the 262,000 Tm-values 
exhibits a very homogeneous Tm value for universally LNA A and T substituted 
oligonucleotides. The standard deviation of the melting temperature for all 9-mers drops from 
7.7°C for pure DNA to only 2.2°C for LNA A and T substituted oligonucleotides. This 

5 equalizing effect may also be utilized for photomediated on-chip synthesis of oligonucleotides. 
It is often difficult to design probes and primers with the same range of melting 
temperature due to the variance in A/T and G/C content of the probing sites. Highly A/T rich 
regions typically give lower Tm values. Furthermore, if single mismatches are to be resolved, 
G/T mismatches are known to contribute little to ATm. As discussed above, the use of LNA is a 

10 desirable way to solve problems related to multiplex use of primers and probes. LNA offers the 
possibility to adjust Tm and increase the ATm at the same time. LNA increases Tm with 4- 
8°C/substitution and increases ATm in many cases (Table 9). 

Table 9. Demonstration of LNA controlled increase of Tm and A Tm. (SEQ ID NOs: ) 

15 


Tm of LNA:DNA 
Duplexes 

Perfect 
match 
3'-ACGACCAC-5' 

Single 
mismatch 
3'-ACGGCCAC-5' 

AT^ 
m 

LNA 8-mer 
5'-TGCTGGTG-3' 

71"C 

45"C 

26"C 

DNA 8-mer 
5'-TGCTGGTG-3* 

35"C 

25"C 

10*^0 


As LNA can be mixed with DNA during standard oligonucleotide synthesis, LNA can be 
placed at optimal positions in probes in order to adjust Tm. The specificity of PGR may also be 
enhanced by the use of LNA in primers, or probes, and this facilitates a higher degree of 
25 multiplexity. By incorporation of LNA, the Tm of the primers or probes can be adjusted to work 
at the same temperature. Amplification or hybridization is more specific when LNA is included 
in primers or probes. This is due to the LNA increased ATm, which relates to higher specificity. 
Once ATm of the primers or probes is high, more primers or probes can potentially be brought to 
work together. 
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Prediction ofTm 

LNA can be used to enhance any experiment that is based on hybridization. The series of 
algorithms described herein have been developed to predict the optimal use of LNA. Melting 
properties of 129 different LNA substituted capture probes hybridized described herein to their 
5 corresponding DNA targets were measured in solution using UV-spectrophotometry. The data 
set was divided into a training set with 90 oligonucleotides and a test set with 39 
oligonucleotides. The training set was used for training of both linear regression models and 
neural networks. Neural networks trained with nearest neighbour information, length, and 
DNA/LNA neighbour effect are efficient for prediction of Tm with the given set of data. 

10 Applications of the Normalization of Thermal Stability by LNA A and T Nucleotide Substitutions 
All assays in which DNA/RNA hybridization is conducted may benefit from the use of 
LNA in terms of increased specificity and quality. Exemplary uses include sequencing, primer 
extension assays, PGR amplification, such as multiplex PGR, allele specific PR amplification, 
molecular beacons, (e.g., nucleic acids be multiplexed with one colour based on multiple Tm's), 

15 Taq-man probes, in situ hybridization probes (e.g., chromosomal and bacterial 16S rRNA 
probes), capture probes to the mRNA poly-A tail, capture probes for microarray detection of 
SNPs, capture probes for expression microarrays (sensitivity increased 5-8 times), and capture 
probes for assessment of alternative mRNA splicing. 

20 Example 15: Exemplary Methods for the Prediction of Melting Temperatures for Nucleic 
Acid Populations of the Invention 

LNA units have different melting properties than DNA and RNA nucleotides. Until 
recently, thermodynamical models for melting temperature prediction have existed for DNA and 
RNA only, but not for LNA. Now a Tm prediction model for LNA/DNA mixed oligonucleotides 

25 has been developed (John SantaLucia, Jr. (1998) Proc. Natl. Acad. Sci. 95 1460 - 1465 and 
Tostesen et al. "Prediction of Melting Temperature for LNA (Locked Nucleic Acid) Modified 
Oligonucleotides" Bioinformatics 2002, Bergen, Norway April 4-7 2002). The Tm prediction 
tool is available on-line at the Exiqon website (www.LNA-Tm.com and 
http://www.exiqon.com/Poster/Tmpred-ET-view.pdf). 

30 Numerous applications in molecular biology are based on the ability of DNA and RNA to 

hybridize in a temperature dependent manner (e.g. the microarray techniques, PGR reactions and 
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blotting techniques). The meUing properties of nucleic acid duplexes, in particular the melting 
temperature Tm, are crucial for optimal design of such experiments. Tm is usually computed 
using a two-state thermodynamical model (Breslauer, Meth. Enzymol., 259:221-242, 1995). 
Several different groups have estimated model parameters for nearest neighbors in the sequence 
5 based on experimental data (for a review see SantaLucia, Proc. Natl. Acad. Sci., 95:1460-1465, 
1998). 

The model described herein predicts the Tm of duplexes of mixed LNA/DNA 
oligonucleotides hybridized to their complementary DNA strands. DNA monomers are denoted 
with lowercase letters, and LNA monomers are denoted with uppercase letters, e.g., there are 
10 eight types of monomers in the mixed strand: a, c, g, t, A, C, G and T. The model is based on the 
formula (SantaLucia, 1998, supra\ Allawi et al. Biochemistry 36:10581-10594, 1997). 

T ^ ^ ^ 

" A5 + • ln(C - 1 2)+ 0.368(i - l)ln[Na^ J' 

15 in which the salt concentration [Na+J enters as an entropic correction together with the 

oligonucleotide concentrations. R is the gas constant, C and are the concentrations of the 

two strands where C > , and L is the length of the strands. For self-complementary 

sequences, C-C^I2 is replaced by the total strand concentration Cj and a symmetry 

correction of -1 .4 cal/k mol is added to AS (SantaLucia, 1998, supra), 
20 The LNA model differs from SantaLucia's DNA model in the way the changes in 

enthalpy AH and entropy AS are calculated. As in SantaLucia's model, they depend on nearest 
neighbor sequence information and special contributions for the terminal base-pairs in the two 
ends of the duplex. However, with eight types of monomers (LNA and DNA) the increased 
number of nearest neighbor combinations requires more model parameters to be determined and 
25 hence more data. 

Parameter Reduction 

Usually AH and AS are calculated as a sum of contributions from all nearest neighbor 
pairs in the sequence. The inclusion of LNA doubles the number of monomer types and 
quadruples the number of possible nearest neighbor pairs. Parameter reduction strategies are 
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used for matching the model complexity to limited data sets. A strategy for reducing model 
complexity is to sum AH from single base-pair contributions, which do not take the influence of 
adjacent nucleotides into account. However, nearest neighbor contributions are added as a 
correction term to the single base-pair contributions. 

Another strategy is to use hierarchically reduced monomer alphabets. Here, similar 
monomers are identified with the same letter. A four-letter alphabet, {w,s,W,S}, defines classes 
according to binding strength: w={a,t}, s={c,g}, W={A,T} and S={C,G}. The smallest alphabet, 
{D,L}, simply identifies the monomer type: DNA or LNA. As an example, the sequence 
GcTAAcTt can be written as SsWWWsWw or as LDLLLDLD. 

The principle is to split AH and AS into contributions that depend on different levels of 
detail of the sequence. The fine levels of detail require many parameters to be determined, while 
the coarse levels need fewer parameters. The more detailed contributions can then be treated as 
minor corrections, thus effectively reducing the total number of model parameters. 
Training 

Model parameters were determined using data from melting experiments on hundreds of 
oligonucleotides. The oligonucleotides were random sequences with lengths between 8 and 20 
and a percentage of LNA between 20 and 70. Melting curves were obtained using a Perkin- 
Elmer UV ^-40 spectrophotometer, but only the Tm values were used for modeling. Model 
parameters were adjusted using a gradient descent algorithm that minimizes the error fiinction 


i.e., the distance between predicted and experimental Tm values. Many different models were 
trained in this way and their performance was evaluated on test sets distinct from the training 
data. Seven reliable models were chosen and combined to form the committee model 
implemented at the Exiqon website (wMw.LNA-Tm.com.) 
Machine Learning And Thermodynamics 

The aim of this work has been to estimate Tm values as accurately as possible. To this 
end, a machine learning approach has been adopted in which the prediction of the physical AH 
and AS quantities is less important. The parameters of this model may be inaccurate as 
thermodynamic quantities. First, the gradient descent algorithm produces a broad ensemble of 



data 
set 
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models in which the AH and AS parameters can vary substantially, while maintaining an 
accuracy in the predicted Tm. Second, the thermodynamic meaning of AH and AS is based on a 
two-state assumption, which may not be realistic in every case. Even short oligonucleotides can 
form different secondary structures or melt through multiple-state transitions (Tostesen et aL, J. 
5 Phys. Chem. B, 1 05 : 1 6 1 8- 1 630, 200 1 ). Third, the use of an optical instrument instead of a 

calorimetric instrument (DSC) introduces an error in the measured AH and AS. Nevertheless, the 
uncertain thermodynamic interpretation of the AH and AS model parameters does not imply that 
the Tm prediction model is unreliable. 
Results 

10 The Tm prediction model has been tested on two data sets that were not used during the 

training process. One set consisted of pure DNA oligonucleotides without LNA monomers and 
had a standard deviation of the residuals (SEP) of 1 .57 degrees. The other set consisted of mixed 
oligonucleotides with both LNA and DNA and had a SEP of 5.25 degrees. The difference in 
prediction accuracy between the two types of oligonucleotides suggests that Tm prediction of 

15 mixed strands is a more complex task than Tm prediction of pure DNA. This is possibly due to 
irregularities in the duplex helical structure induced by the LNA monomers (Nielsen et al, 
Bioconjug. Chem. 1 1 :228-238, 2000). The obtained prediction accuracy is in both cases adequate 
for most biological applications. In conclusion, the reduced nearest neighbor model implemented 
at the Exiqon website (www . LN A-Tm . com^ can predict Tm surprisingly well for both types of 

20 oligonucleotides. This indicates that the parameter reduction strategy is applicable for other types 
of modified oligonucleotides. 

Example 16A: Algorithm to Optimize the Substitution Pattern of Nucleic Acids of the 
Invention 

25 High affinity nucleotides such as LNA and other nucleotides that are conformationally 

restricted to prefer the C3'-endo conformation or nucleotides with a modified backbone and/or 
nucleobase stabilize a double helix configuration. As these effects are generally additive, the 
most stable duplex between a high affinity capture oligonucleotide and an unmodified target 
oligonucleotide should generally arise when all nucleotides in the capture probe or primer are 

30 replaced by their high affinity analogue. The most stable duplex should thus be formed between 
a fully modified LNA capture probe and the corresponding DNA/RNA target molecule. Such a 
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fully modified capture probe should be more efficient in capturing target molecules, and the 
resulting duplex is more thermally stable. 

However, many high affinity nucleotides (e.g., as LNA) have an even higher affinity for 
other high affinity nucleotides (e.g., as LNA) than for DNA/RNA. A ftilly modified capture 
5 probe may thus form duplexes with itself, or if it is long enough, internal hairpins that are even 
more stable than duplexes with the desired target molecule. Probes with even a small inverse 
repeat segment where all constituent positions are substituted with high affinity nucleotides may 
bind to itself and be unable to bind the target. Thus, a sequence dependent substitution pattern is 
desirably used to avoid substitutions in positions that may form self-complementary base-pairs. 

10 For example, a computer algorithm can be used to automatically determine the optimal 

substitution pattern for any given capture probe sequence according to the following two criteria. 
First, the difference between the stability of (i) the duplex formed between the capture probe and 
the target molecule and (ii) the best possible duplex between two capture probes should be above 
a certain threshold. If this is not possible, then the substitution pattern with the largest possible 

15 difference is chosen. Second, the capture probe should contain as many substitutions as possible 
in order to bind as much target as possible at any given temperature and to increase the thermal 
stability of the formed duplex. Alternatively, the second criterion is substituted with the 
following alternative criterion to obtain capture probes with similar thermal stability. The 
number and position of capture probe substitutions should be adjusted so that all the duplexes 

20 between capture probes and targets have a similar thermal stability (i.e., Tm equalization). 

For oligonucleotide capture probes such, incomplete matches between target and capture 
probe are likely to be a reproducible feature of the recorded biosignatures. For short probes, the 
second criterion for increasing thermal stability is more desirable that the alternative second 
criterion for Tm equalization. For long capture probes and PCR primers, the second alternative 

25 criterion is desirably used since Tm equalization is desirable for these probes and primers. 

An exemplary algorithm works as follows. For each nucleotide sequence in an array of 
length n, all possible substitution patterns, i.e., 2" different sequences are evaluated. Each 
evaluation consist of estimating the energetic stability of the duplex between the substituted 
capture sequence and a perfect match unmodified target ("target duplex") and the energetic 

30 stability of the most stable duplex that can be formed between two substituted capture probes 
themselves ("self duplex"). 
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The energetic stability estimate for a duplex may be calculated, e.g., using a Smith- 
Waterman algorithm with the following scoring matrix. 
Gap initiation penalty: -8 
Gap continuation penalty: -50 


10 



a 

C 

g 

t 

A 

C 

G 

T 

a 

-2 








c 

-2 

-2 







g 

-2 

3 

-2 






t 

2 

-2 

1 

-2 





A 

-3 

-3 

-3 

4 

-3 




C 

-3 

-3 

6 

-3 

-3 

-3 
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-3 

6 

-3 

2 

-3 

9 

-3 


T 

4 

-3 

2 

-3 

6 

-3 

3 

-3 


1 5 This scoring matrix was partly based on the best parameter fit to a large (over 1 000) number of 
melting curves of different DNA and LNA containing duplexes and partly by visual scoring of 
test capture probe efficiency. If desired, this scoring matrix may be optimized by optimizing the 
parameter fit as well as increasing or optimizing the dataset used to obtain these parameters. 

As an example of these calculations, the heptamer sequence ATGCAGA in which each 

20 position can be either an LNA or a DNA nucleotide is used. The target duplex formed between a 
fully modified capture probes with this sequence and its unmodified target receive a score of 34 
as illustrated below. 


Capture sequence : 

25 

Target sequence : 
Score : 


A-T-G-C-A-G-A 

I I I I I I I 
t-a-c-g-t-c-t 

4+4+6+6+4+6+4 = 34 


The most stable self duplex that can be formed between two modified capture probes has an 
30 almost equivalent energetic stability with a score of 30 as illustrated below. 
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Capture sequence: A-T-G-C-A-G-A 

I I I I 

Target sequence: A-G-A-C-G-T-A 
Score: +6+9+9+6 = 30 

5 

Thus, the capture probe efficiency of a fully modified probe is likely reduced by its propensity to 
form a stable duplex with itself. In contrast, by choosing a slightly different substitution pattern, 
ATGcaGA in which capital letters represent LNA nucleotides, the stability of the target duplex is 
reduced slightly from 34 to 29. 

10 

Capture sequence: A-T-G-c-a-G-A 

I I I M I I 

Target sequence: t-a-c-g-t-c-t 
Score: 4+4+6+3+2+6+4 = 29 

15 

However, the most stable self complementary duplex that can be formed is reduced much more 
from 30 to 20, as illustrated below. 


Capture sequence: A-T-G-c-a-G-A 

INI 

Target sequence: A-G-a-c-G-T-A 
Score: +4+6+6+4 = 20 


The difference between the stability of the desired target duplex and the undesired self 
25 duplex can be further increased by using the capture sequence AtgcaGA where the target duplex 
has a score of 24. 


Capture sequence: A- t-g-c-a-G-A 

I I I M M 

30 Target sequence: t-a-c-g-t-c-t 

Score: 4+2+3+3+2+6+4 = 24 
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Whereas the score of the self duplex is only 10, as shown below. 

Capture sequence: A-t-g-c-a-G-A 

5 I I I I 

Target sequence: A-G-a-c-g-t-A 
Score: +2+3+3+2 = 10 

The additional destabilization of the self duplex is generally not required if the difference in 
1 0 stability between the target duplex and self duplex is above a threshold of 25% of the target 
duplex stability, as illustrated below. 


Discrimination 
(25%) 

15 Discrimination 
(25%) 

Discrimination 
(25%) 


for ATGCAGA = 
for ATGcaGA = 
for ATGCAGA = 


(34-30)/34 = 
(29-20)/29 = 
(24-10)/24 = 


12% < threshold 
31% > threshold 
58% > threshold 


20 Thus, ATCcaGA is the substitution pattern with the highest degree of substitution for which the 
stability of the target duplex is adequately more stable than the stability of the best self duplex 
(e.g., above 25%). 

This algorithm can be used to determine desirable substitution patterns for any size 
capture probe or any given probe sequence. The following simple design rules may also be 
25 applied for probe design, especially for short probes. The best self alignment for the 
corresponding DNA capture probe in the sequence is determined using a simple Smith- 
Waterman scoring matrix of: 


a c g t 
30 a -2 

c -2 -2 
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g -2 3 -2 

t 2-2 1-2 


Additionally, all possible positions in the sequence are substituted, with the exception of 
5 desirably avoiding the substitution of both bases of a self-complementary base -pair. The most 
stable self duplex thus does not contain any LNA:LNA base-pairs but only LNA:DNA basepairs. 

Example 16B: Computer code for a preferred software program of the invention. 

Exemplary programs are provided in the Computer Program Listing Appendix submitted 
10 on compact disc herewith. The oligod program (50287.007002_oligod.txt) takes a gene 

sequence as input and returns sequences for LNA spiked oligonucleotides. The dyp program 
(50287.007002_dyp.txt) is used by oligod to predict the secondary structure and self annealing 
properties of the oligonucleotides. The expression_array_param file 

(50287.007002 expression_array_param.txt) contains parameters used by the oligod program. 
15 50287.007002_tmprediction.txt contains code for a Tm prediction program, and 

50287.007002_tmthermodynamic.txt contains code for a Tm thermodynamic model. 

Exemplary Computer 

Any of the methods described herein may be implemented using virtually any computer, 
20 Fig. 35 shows such an exemplary computer system. Computer system 2 includes internal and 
external components. The internal components include a processor 4 coupled to a memory 6. 
The external components include a mass-storage device 8, e.g., a hard disk drive, user input 
devices 10, e.g., a keyboard and a mouse, a display 12, e.g., a monitor, and usually, a network 
link 14 capable of connecting the computer system to other computers to allow sharing of data 
25 and processing tasks. Programs are loaded into the memory 6 of this system 2 during operation. 
These programs include an operating system 16, e.g., Microsoft Windows, which manages the 
computer system, software 18 that encodes common languages and fiinctions to assist programs 
that implement the methods of this invention, and software 20 that encodes the methods of the 
invention in a procedural language or symbolic package. Languages that can be used to program 
30 the methods include, without limitation. Visual C/C^ fi-om Microsoft. In preferred applications, 
the methods of the invention are programmed in mathematical software packages that allow 
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symbolic entry of equations and high-level specification of processing, including algorithms used 
in the execution of the programs, thereby freeing a user of the need to program procedurally 
individual equations or algorithms. An exemplary mathematical software package useful for this 
purpose is Matlab from Mathworks (Natick, MA). Using the Matlab software, one can also 
5 apply the Parallel Virtual Machine (PVM) module and Message Passing Interface (MPI), which 
supports processing on multiple processors. This implementation of PVM and MPI with the 
methods herein is accomplished using methods known in the art. Alternatively, the software or a 
portion thereof is encoded in dedicated circuitry by methods known in the art. 

10 Example 17.: Exemplary Locked Nucleic Acids (LNA) 

As disclosed in WO 99/14226, LNA are DNA analogues that form DNA- or RNA- 
heteroduplexes with exceptionally high thermal stability. LNA units include bicyclic compounds 
as shown immediately below where ENA refers to 2'0,4'C-ethylene-bridged nucleic acids: 



o=p — o- 

\ 

ENA 

15 

References herein to Locked Nucleoside Analogues, LNA units, LNA monomers, or 
similar terms are inclusive of such compounds as disclosed in WO 99/14226, WO 00/56746, WO 
00/56748, and WO 00/66604. 
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Desirable LNA monomers and oligomers share some chemical properties of DNA and 
RNA; they are water soluble, can be separated by agarose gel electrophoresis, and can be ethanol 
precipitated. 

Desirable LNA monomers and oligonucleotide units include nucleoside units having a 2'- 
5 4' cyclic linkage, as described in the International Patent Application WO 99/14226 and WO 
0056746, WO 0056748, and WO 0066604. Desirable LNA monomers structures are 
exemplified in the formulae la and lb below. In formula la the configuration of the fiiranose is 
denoted D - p, and in formula lb the configuration is denoted L - a. Configurations which are 
composed of mixtures of the two, e.g. D - (3 and L - a, are also included. 


In la and lb, X is oxygen, sulfur and carbon; B is a universal or modified base 
(particularly non-natural occurring base) e.g. pyrene and pyridyloxazole derivatives, pyrenyl, 
pyrenylmethylglycerol moieties, all of which may be optionally substituted. Other desirable 
universal bases include, pyrrole, diazole or triazole moieties, all of which may be optionally 

15 substituted, and other groups e.g. modified adenine, cytosine, 5-methylcytosine, isocytosine, 
pseudoisocytosine, guanine, thymine, uracil, 5-bromouracil, 5-propynyluracil, 5-propyny-6- 
fluoroluracil, 5-methylthiazoleuracil, 6-aminopurine, 2-aminopurine, inosine, diaminopurine, 7- 
propyne-7-deazaadenine, 7-propyne-7-deazaguanine. R\ or or R"^' , R^ and R^' are 

hydrogen, methyl, ethyl, propyl, propynyl, aminoalkyl, methoxy, propoxy, methoxy-ethoxy, 

20 fluoro, or chloro. 

P designates the radical position for an intemucleoside linkage to a succeeding monomer, 
or a 5 -terminal group, R^ or R^ is an intemucleoside linkage to a preceding monomer, or a 3 - 
terminal group. The intemucleotide linkage may be a phosphate, phosphorothioate. 


10 



142 


phosphorodithioate, phosphoramidate, phosphoroselenoate, phosphorodiselenoate, 
alkylphosphotriester, or methyl phosphonate. The intemucleotide linkage may also contain non- 
phosphorous linkers, hydroxylamine derivatives (e.g. -CH2-NCH3-O-CH2-), hydrazine 
derivatives, e.g. -CH2-NCH3-NCH3-CH2, amid derivatives, e.g. -CH2- CO-NH-CH2-, CH2-NH- 
5 CO-CH2-. In la, R^^* and R^' together designate -CH2-O-, -CH2-S-, -CH2-NH-,-CH2-NMe-, -CH2- 
CH2-O-, -CH2-CH2-S-, -CH2-CH2-NH-, or -CH2-CH2-NMe- where the oxygen, sulfur or 
nitrogen, respectively, is attached to the 2 -position. In Formula lb, R"* and R^ together designate 
-CH2-O-, -CH2-S-, -CH2-NH-, -CH2-NMe-, -CH2-CH2-O-, -CH2-CH2-S-, -CH2-CH2-NH-, or - 
CH2-CH2-NMe- where the oxygen, sulphur or nitrogen, respectively, is attached to the 2-position 

10 (R^ configuration). 

Desirable LNA monomer structures are structures in which X is oxygen (Formula la and 
lb); B is a universal base such as pyrene; R^ R^ or R^ , R^ or R^ , R^ and R^' are hydrogen; P is a 
phosphate, phosphorothioate, phosphorodithioate, phosphoramidate, and methyl phosphomates; 
or is an intemucleoside linkage to a preceding monomer, or a 3'-terminal group. In 

15 Formula la, R"*' and R^' together designate -CH2-O-, -CH2-S-, -CH2-NH-, -CH2-NMe-, -CH2- 
CH2-O-, -CH2-CH2-S-, -CH2-CH2-NH-, or -CH2-CH2-NMe- where the oxygen, sulphur or 
nitrogen, respectively, is attached to the 2'-position, and in Formula lb, R"^ and R^ together 
designate -CH2-O-, -CH2-S-, -CH2-NH-,-CH2-NMe-, -CH2-CH2-O-, -CH2-CH2-S-, -CH2-CH2- 
NH-, or -CH2-CH2-NMe- where the oxygen, sulphur or nitrogen, respectively, is attached to the 

20 2-position in the R^ configuration. 

Particularly desirable LNA monomer for incorporation into an oligonucleotide of the 
invention include those of the following formula Ila 


wherein X oxygen, sulfiir, nitrogen, substituted nitrogen, carbon and substituted carbon, and 
desirably is oxygen; B is a modified base as discussed above e.g. an optionally substituted 


25 
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carbocyclic aryl such as optionally substituted pyrene or optionally substituted 
pyrenylmethylglycerol, or an optionally substituted heteroalicylic or optionally substituted 
heteroaromatic such as optionally substituted pyridyloxazole. Other desirable universal bases 

1 * 2 

include, pyrrole, diazole or triazole moieties, all of which may be optionally substituted; R , R , 
5 R^, R^ and R^* are hydrogen; P designates the radical position for an intemucleoside linkage to a 
succeeding monomer, or a 5'-terminal group, R^* is an intemucleoside linkage to a preceding 
monomer, or a 3 -terminal group; and R^* and R"** together designate -O-CH2- or -CH2-CH2-O- 
where the oxygen is attached in the 2'-position, or a linkage of -(CH2)n- where n is 2, 3 or 4, 
desirably 2, or a linkage of -S-CH2- or -NH-CH2-. 

10 LNA units of formula Ila where R^* and R"** contain oxygen are sometimes referred to 

herein as "oxy-LNA"; units of formula Ila where R^* and R"** contain sulfur are sometimes 
referred to herein as "thio-LNA"; and units of formula Ila where R^* and R"** contain nitrogen are 
sometimes referred to herein as "amino-LNA". For many applications, oxy-LNA units are 
desirable modified nucleic acid units of oligonucleotides of the invention. 

15 Particularly desirable LNA monomers for use in oligonucleotides of the invention are 2 - 

deoxyribonucleotides, ribonucleotides, and analogues thereof that are modified at the 2Vposition 
in the ribose, such as 2'-0-methyl, 2'-fluoro, 2'-trifluoromethyl, 2 '-0-(2-methoxyethyl), 2'-0- 
aminopropyl, 2'-0-dimethylamino-oxyethyl, 2'-0-fluoroethyl or 2'-0-propenyl, and analogues 
wherein the modification involves both the 2 'and 3' position, desirably such analogues wherein 

20 the modifications links the T- and 3'-position in the ribose, such as those described in Nielsen et 
al., J. Chem. Soc, Perkin Trans. 1, 1997, 3423-33, and in WO 99/14226, and analogues wherein 
the modification involves both the 2'- and 4*-position, desirably such analogues wherein the 
modifications links the T- and 4 -position in the ribose, such as analogues having a -CH2-S- or a 
-CH2-NH- or a -CH2-NMe- bridge (see Singh et al. J. Org. Chem. 1998, 6, 6078-9). Although 

25 LNA monomers having the p-D-ribo configuration are often the most applicable, other 

configurations also are suitable for purposes of the invention. Of particular use are a-L-ribo, the 
p-D-xylo and the a-L-xylo configurations (see Beier et al.. Science, 1999, 283, 699 and 
Eschenmoser, Science, 1999, 284, 2118), in particular those having a 2*-4' -CH2-S-, -CH2-NH-, - 
CH2-O- or -CH2-NMe- bridge. 

30 In another desirable embodiment, LNA modified oligonucleotides used in this invention 

comprises oligonucleotides containing at least one LNA monomeric unit of the general scheme 
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A above, wherein X, B, P are defined as above. One of the substituents R^, R^*, R"^, and R^* is a 
group P* which designates an intemucleoside linkage to a preceding monomer, or a 273'- 
terminal group. Two of the substituents of R^*, R^ R^*, R^ R^*, R^ R^*, R^ R^*, R^ and R^* 
when taken together designate a biradical structure selected from -(CR*R*)r-M-(CR*R*)s-, 

5 -(CR*R*)rM-(CR*R*)s-M-, -M-(CR*R*)rfs-M-, .M-(CR*R*)rM-(CR*R*)s-, -(CR*R*W, -M-, -M- 
wherein each M is independently selected from -0-, -S-, -Si(R*)2-, -N(R*)-, >C=0, -C(=0)- 
N(R*)-, and -N(R*)-C(=0)-. Each R*and R'^'*^-R^^^*\ which are not involved in the biradical, are 
independently selected from hydrogen, halogen, azido, cyano, nitro, hydroxy, mercapto, amino, 
mono- or di(Ci-6-alkyl)amino, optionally substituted Ci.6-alkoxy, optionally substituted Ci.6- 

10 alkyl, DNA intercalators, photochemically active groups, thermochemically active groups, 
chelating groups, reporter groups, and ligands, and/or two adjacent (non-geminal) R* may 
together designate a double bond, and each of r and s is 0-4 with the proviso that the sum r+s is 
1-5. 

Examples of LNA units are shown scheme B: 

15 
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Scheme B 

wherein the groups, X and B are defined as above, P designates the radical position for an 
intemucleoside linkage to a succeeding monomer, nucleoside such as an L-nucleoside, or a 5'- 
terminal group, such intemucleoside linkage or 5 '-terminal group optionally including the 
5 substituent R^. One of the substituents R^, R^*, R^, and R^* is a group P* which designates an 
intemucleoside linkage to a preceding monomer, or a 2 73 '-terminal group. 

Desirable nucleosides are L-nucleosides such as for example, derived dinucleoside 
monophosphates. The nucleoside can be comprised of either a beta-D, a beta-L or an alpha-L 
nucleoside. Desirable nucleosides may be linked as dimers wherein at least one of the 

10 nucleosides is a beta-L or alpha-L. B may also designate the pyrimidine bases cytosine, 5- 

methyl-cytosine, thymine, uracil, or 5-fluorouridine (5-FUdR) other 5-halo compounds, or the 
purine bases, adenosine, guanosine or inosine. 

As discussed above, a variety of LNA units may be employed in the monomers and 
oligomers of the invention including bicyclic and tricyclic DNA or RNA having a 2*-4' or 2-3' 

15 sugar linkages; 2 -0,4'-C-methylene-P-D-ribofuranosyl moiety, known to adopt a locked C3*- 
endo RNA-like furanose conformation. Illustrative modified structures that may be included in 
oligonucleotides of the invention are shown in Figure 1 . Other nucleic acid units that may be 
included in an oligonucleotide of the invention may comprise 2'-deoxy-2 -fluoro ribonucleotides; 
2'-0-methyl ribonucleotides; 2 -O-methoxyethyl ribonucleotides; peptide nucleic acids; 5- 

20 propynyl pyrimidine ribonucleotides; 7-deazapurine ribonucleotides; 2,6-diaminopurine 

ribonucleotides; and 2-thio-pyrimidine ribonucleotides, and nucleotides with other sugar groups 
(e.g. xylose). 

Oligonucleotides containing LNA are readily synthesized by standard phosphoramidite 
chemistry. The flexibility of the phosphoramidite synthesis approach further facilitates the easy 
25 production of LNA oligos carrying all types of standard linkers, fluorophores and reporter 
groups. 

Example 18: Selective Binding Complementary (SBC) nucleotides 

Selective Binding Complementary (SBC) nucleotides are unable to form stable hybrids 
30 with each other, yet are able to form stable, sequence-specific hybrids with complementary 

unmodified strands of nucleic acids. Thus, the reduced ability of SBC oligonucleotides to form 
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intramolecular hydrogen bond base-pairs between regions of substantially complementary 
sequence causes a reduced level of secondary structure. Self-complementarity is an important 
issue in nucleic acid technologies as reported for DNA, PNA and LNA, and in different 
biological applications especially in the field of homogeneous assays. LNA:LNA duplexes are 
5 the most thermally stable nucleic acid type duplex system known, making the reduction of self- 
complementarity even more important. 

Exemplary SBC oligonucleotides contain 2-amino-A (D) and ^^T incorporated in the 
same oligonucleotide as replacements of A and T, respectively. The SBC name refers to the fact 
that D and ^^T form a destabilised base-pair compared to the A-T base-pair, but D-T and ^^T-A 

10 base-pairs are normally more stable than the original A-T base -pair. Exemplary SBC-G 
nucleotides include inosine or LNA-inosine, and exemplary SBC-C nucleotides indue 
PyrroloPyr, LNA- PyrroloPyr, ^^C, and LNA-^^C (Fig. 3). Other exemplary SBC nucleotides are 
shown in Figs. 2 and 3. If desired, SBC nucleotides may be incorporated into the nucleic acids 
and arrays of the invention, using standard methods. 

15 The systems disclosed herein can provide significant nucleic acid probes for universal 

hybridization. In particular, universal hybridization can be accomplished with a 
conformationally restricted monomer, including a desirable pyrene LNA monomer. Universal 
hybridization behavior also can be accomplished in an RNA context. Additionally, the binding 
affinity of probes for universal hybridization can be increased by the introduction of high affinity 

20 monomers without compromising the base-pairing selectivity of bases neighboring the universal 
base. 

Incorporation of one or more modified nucleobases or nucleosidic bases into an 
oligonucleotide can provide significant advantages. Among other things, LNA oligonucleotides 
can often self-hybridize, rather than hybridize to another oligonucleotide. Use of one or more 
25 modified bases with the LNA units can modulate the propensity of the oligonucleotide to form 
double stranded structures with other oligonucleotides containing modified nucleobases 
including internal duplex formation, thereby inhibiting undesired self-hybridization. 
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Example 19: Exemplary Methods for Synthesizing LNA-2-thiopyrimidine Nucleosides and 
Nucleotides 

2-Thiopyrimidine nucleosides can be prepared in several ways as described below. For 
example, the 2-thiouridine-nucleosides (IV) can be synthesized from a substituted uridine 

5 nucleoside (VIII) as described in the scheme below. By protection of the 04-position (IX) on 
the nucleobase thionation can be performed, 02 position, which results in the 2-thio-uridine 
nucleoside (IV). Performing sulphurisation on both 02 and 04 results in 2,4-dithio-uridine 
nucleoside (X) which may be transformed into the 2-thio-uridine nucleoside (IV) (Saladino, et 
al, Tetrahedron, 1996, 52, 6759). Another way is to generate a cyclic ether (XI) through 

10 reaction with the 5' position this product can then be transformed to the 2-thio-uridine nucleoside 
(IV) or the 2-O-alkyl-uridine nucleoside (XII). The 2-0-alkyl-uridine nucleoside (XII) can also 
be generated by direct alkylation of the uridine nucleoside (VIII). Treatment of the 2-0-alkyl- 
uridine nucleoside (XII) can also be transformed into the 2-thio-uridine nucleoside (Brown et 
al, J. Chem. Soc. 1957, 868; Singer, et al, Proc. Natl. Acad. Sci. USA, 1983, 80, 4884; Rajur 

15 and McLaughlin, Tetrahedron Lett., 1992, 33, 6081). 


o 
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In another method, lewis acid-catalyzed condensation of a properly substituted sugar (I) 
and a substituted 2-thio-uracil (II) can result in a substituted 2-thio-uridine nucleoside of the 
structure (III) which by further synthetic manipulations can be transformed into the LNA 2- 
thiouridine nucleoside (IV) (Hamamura et. aL, Moffatt, J. Med. Chem., 1972, 15, 1061; Bretner 
5 et al, J. Med. Chem., 1993, 36, 361 1). 


O 



I II III 


O 



10 Using a properly substituted amino-sugar (V), a 2-thio-uridine nucleoside can be 

. synthesized through ring-synthesis of the nucleobase by reaction of the amino sugar (V) and an 
substituted isothiocyanate(VI), yielding the substituted LNA 2-thio-uracil nucleoside (VI) (Shaw 
and Warrener, J. Chem. Soc. 1957, 153; Cusack et al, J. Chem. Soc. Perkin 7, 1973, 1721). 
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Example 20: Exemplary Methods for Synthesizing ^*T-LNA 

Three different strategies for synthesis of ^^T-LNA are outlined in the Summary of the 
Invention section. Strategy A involves coupling a glycosyl-donor and a nucleobase, using 
standard methodology for synthesis of existing LNA monomers. Strategy B involves ring 
synthesis of the nucleobase. This strategy is desirable because the availability of 1 -amino-LN A 
enables introduction of a variety of new nucleobases. Strategy C includes modification of T- 
LNA; the easy synthesis of LNA-T diol makes this an attractive pathway. 

In a desirable embodiment, ^^T-LNA is synthesized as illustrated in the scheme below. 
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In particular, the known coupling sugar l,2-di-0-acetyl-3, 5 di-O-benzyl, 4-C- 
mesyloxymethyl, a,p-D-ribofuranose 1 was coupled with the nucleobase 2-thio-thymidine in a 
5 Vorbriiggen type reaction. Thus, the nucleobase was silyilated and condensed with the sugar 
using SnCU as catalyst to promote the reaction affording nucleoside 2. Mass spectrometry and 
NMR subsequently identified the isolated product as the desired one. NMR data were compared 
with published data of a 2-thio-thymindine derivative (Kuimelis and Nambiar, Nucleic Acid 
Res., 1994, 22, 1429-1436) in order to validate the correct attachment point of the nucleobase. 

1 0 Subsequently, a base mediated ring-closing reaction afforded the di-benzylated LNA 

derivative 3 in 77% yield. The signals in the ^H-NMR spectrum of the compound appeared as 
singlets, thus proving that the cyclization had occurred to give the LNA skeleton, in which the 
1 and 2'-H are perpendicular to each other causing the ^Jr,2' to be 0 Hz. MALDI mass 
spectrometry was likewise used for the identification of the compound. 

15 The LNA derivative was protected at the nucleobase with the toluoyl protective group to 

give 4. This group is well known for the protection of 2-thio-thymidine derivatives, (Kuimelis 
and Nambiar, Nucleic Acid Res., 1994, 22, 1429-1436). The protection of the nucleobase occurs 
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at both the N-3 and the 0-4 position and hence the compound is isolated as a mixture of two 
compounds. NMR shows that the ratio of the two isomers in the isolated mixture is 2: 1 . 
These methods are described further below. 


5 l-(2-0-acetyl-3-0, S-O-dibenzyl, 4-C-mesyloxymethyI-P-D-ribofuranosyl)-2-thio-thyinine 
(2) 

1, 2-di-0-acetyl-3, 5 di-O-dibenzyl, 4-C-mesyloxymethyl, a,P-D-ribofuranose (1, 2.0g, 
3.83 mmol) and 2-thio-thymine (552mg, 3.89mmol) were co-evaporated with anhydrous 
acetonitrile (100 ml) and redissolved in anhydrous acetonitrile (80ml), N.O- 

10 bistrimethylsilylacetamide (1.5, 5.85mmol) was added, and the reaction was stirred at 80°C for 
one hour. The mixture was cooled to 0°C, SnCU (0.9 ml, 7.66mmol) was added, and the reaction 
was left to stir for 24 hours. The reaction mixture was diluted with EtOAc and washed with 
NaHCOa and subsequently with water. The organic phase was dried (Na2S04) and evaporated to 
dryness. The product was purified using column chromatography, giving the thio-thymidine 

15 derivative 2 (l.lg, 1.82mmol, 40%) as a white foam. Rf (10% THF/dichloromethane): 0.75. 
MALDI-MS: 627 (M+Na) ^^C-NMR (CDCI3): 5= 174.40, 169.29, 159.89, 136.13, 136.51, 
136.05, 128.62, 128.56, 128.41, 128.29, 128.07, 127.89, 12767, 116.18, 91.41, 86.21, 75.59, 
75.31, 74.46, 74.22, 73.61, 69.25, 69.04, 37.52, 20.62, 11.91 

20 (iif,JJ?,^/?,75)-7-(ben2yIoxy)-l-(benzyloxymethyI)-3-(2-thiothymidine)-2,^^ 
dioxabicyc]o[2.2.1]heptane (3) 

1 -(2-0-acetyl-3-0, 5-0-dibenzyl, 4-C-mesyloxymethyl-P-D-ribofuranosyl)-2-thio- 
thymine (2, 630mg, 1.04mmol) was dissolved in dioxane (15ml) and water (8ml), and aqueous 
NaOH (2M, 5ml) was added, and the reaction was left to stir at room temperature for one hour. 

25 The yellow solution was neutralized with HCl (IM, 6ml) affording a precipitation. The mixture 
was diluted with dichloromethane and ethyl acetate causing an emulsion. After separation, the 
aqueous phase extracted with ethyl acetate, and the combined organic phase was dried (Na2S04) 
and evaporated to dryness. The compound was purified by column chromatography (0-2, then 
5% THF/dichloromethane), giving the ring closed compound 3 as a white foam (370mg, 

30 0.79mmol, 77%). Rf (2% MeOH/dichloromethane): 0.23. 
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MALDI-MS: 488 (M+Na) '^C-NMR (CDCI3): 8= 173.14, 160.39, 137.20, 136.63, 136.00, 
128.46, 128.34, 128.02, 127.66, 115.52, 90.29, 87.77, 77.39,75.26, 73.77, 72.07, 71.70, 64.15, 
30.17, 12.33 

^H-NMR (CDCI3): S= 9.87 (s, IH), 7.69 (d, l.lHz, IH), 7.26-7.37 (m, lOH), 6.13 (s, IH), 4.84 
5 (s, IH), 4.66 (d, J= 1 1.3 Hz, IH), 4.61 (s, 2H), 4.52 (d, 1 1.5Hz, IH), 4.04 (d, J=7.7Hz, IH), 3.93 
(s, IH), 3.88 (d, J= ll.OHz, IH), 3.82 (d, J= 7.7Hz, IH), 3.82 (d, J= 10.8 Hz, IH), 1.59 (d, J= 1.1 
Hz, 3H) 

(i/?,il?,^/f,75)-7-(benzyloxy)-l-(benzyloxyinethyl)-3-(2-thio-N3/04-toIuoyl- thymidine)-2,5- 
10 dioxabicyclo[2.2.1]heptane (4) 

(ii?,3i?,^/?,75)-7-(benzyloxy)-l<benzyloxymethyl)-3-(2-thiothymidine) 
dioxabicyclo[2.2.1]heptane (3, 290mg, 0.62mmol) was dissolved in anhydrous pyridine and 
diisopropylethylamine (0.2ml, 1.15nimol), toluoyl chloride (0.25ml, 1.89 mmol) was added, and 
the reaction mixture was stirred at room temperature for three hours. After completion, the 

15 reaction mixture was diluted with dichloromethane, and the reaction was quenched by addition 
of water. The phases were separated, and the organic phase was dried (Na2S04) and evaporate to 
dryness. The residue was co-evaporated with toluene. The product was purified by column 
chromatography (0-1% MeOH/dichloromethane) to give nucleoside 4 as a white foam (320 mg, 
0.55mmol, 89%). Rf (2%MeOH/dichloromethane): 0.78. 

20 MALDI-MS: 606 (M+Na) ^^C-NMR (CDCI3): 8=171.98, 168.30, 160.30, 145.92, 145.82, 
137.22, 136.65, 135.98, 130.39, 130.27, 129.85, 129.50, 128.51, 128.41, 128.08, 127.73, 115.11, 
90.10, 87.81, 76.01, 75.80, 75.39, 75.01, 73.83, 72.19, 72.09, 71.74, 64.15, 21.75, 12.40. 

Example 21: Exemplary Methods for Synthesizing LNA-I, LNA-D, and LNA-2AP 
25 2'-0, 4'-C-methylene linked (LNA) nucleosides containing hypoxanthine (or inosine) 

(LNA-I), 2,6-diaminopurine (LNA-D), and 2-aminopurine (LNA-2AP) nucleobases were 
efficiently prepared via convergent syntheses. The nucleosides were converted into 
phosphoramidite monomers and incorporated into LNA oligonucleotides using an automated 
phosphoramidite method. The complexing properties of oligonucleotides containing these LNA 
30 nucleosides were assessed against perfect and singly mismatch DNA. 
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Hypoxantine, the base found in the nucleotides inosine and deoxyinosine, is considered as a 
guanine analogue in nucleic acids. 
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Oligonucleotides containing 2,6-dianiinopurine replacements for adenines are expected to 
bind more strongly to their complementary sequences especially as part of A-type helixes due to 
the potential formation of three hydrogen bounds with thymine or uracil. The reported effect of 
2,6-diaminopurine deoxyriboside (D) on the stability of polynucleotide duplexes reaches, on 
5 average, about 1 .5 °C per modification. Higher stabilization effects for mismatches were 
observed for D nucleosides involved in formation of duplexes prone to form A-type helixes. 
LNA D and LNA 2'-0Me-D are expected to have increased stabilization and mismatch 
discrimination. LNA can be used in combination with 24hio-T for construction of selectively 
binding complementary oligonucleotides. Taking into consideration the extremely high stability 

10 of LNAiLNA duplexes, this approach might be very useful for constructing of LNA containing 
capture probes and antisense reagents including drugs. 

2-Aminopurine (2-AP) is a fluorescent nucleobase (emission at 363 mn), which is useful 
for probing nucleic acids structure and dynamics and for hybridizing with thymine in Watson- 
crick geometry. LNA-I, LNA-D, and/or LNA-2AP may be used in the nucleic acids of the 

1 5 present invention, e.g., to increase the priming efficiency of DNA oligonucleotides in PCR 
experiments and to construct selectively binding complementary agents. 
Synthesis of LNA-I 

The convergent method adopted for preparation of LNA monomers (Koshkin et aL, J. 
Org, Chem. 66:8504, 2001) was successfully applied for syntheses of the modified LNA 

20 nucleotides 1-3. The synthetic route to LNA-I phosphoramidite 11 is depicted in the scheme 
below. The previously described 4-C-branched furanose 4 (Koshkin et aL, supra) was used as a 
glycosyl donor in coupling reaction with silylated hypoxantine by the method of Vorbriiggen et 
al (Vorbriiggen et al , Chem. Ber. 114:1 234, 1981; Vorbriiggen et al , Chem. Ber. 114:1256, 
1981; and Vorbriigen, Acta Biochim. Pol., 43:25, 1996). The reaction resulted in high yield 

25 formation of desired P-configurated nucleoside derivative 5. However, analogous to the 

coupling reaction of 4 with protected guanines, the formation of undesired isomer (ratio of 
N-9/N'l =4:1) was also detected. The mixture of the isomers was used for the ring closing 
reaction and protected LNA nucleoside 6 was isolated in 68 % yield as a crystalline compound. 
The correct structure of the isolated isomer was confirmed later by chemical conversion of LNA- 

30 I into LNA-A nucleoside (vide infra). Deprotection of the 5 -hydroxy group of 6 was 

accomplished via two-step procedure developed for the syntheses of other LNA nucleosides ( 
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Koshkin et aL, supra). First, S'-O-mesyl group was displaced by sodium benzoate to produce 
nucleoside 7. The latter was converted into 5'-hydroxy derivative 8 after saponification of the 5'- 
benzoate. Direct removal of the 3'-0-benzyl group from compound 8 was unsuccessful under 
the conditions tested due to a solubility problem. Therefore, compound 8 was converted to 
5 DMT-protected nucleoside 9 prior to catalytic debenzylation of the 3'-0-hydroxy group. The 
phosphoramidite 11 was finally afforded via standard phosphitylation (McBride et aL, 
Tetrahedron Lett. 24:245, 1983; Sinha et al. Tetrahedron Lett. 24:5843, 1983; and Sinha et al. 
Nucleic Acids Res. 12:4539, 1984) of the nucleoside 10. In order to verify the correct 
orientation of the glycoside bond {N-9 isomer) in synthesized LNA-I nucleoside, compound 7 

10 was successfully converted into the known LNA-A derivative 13 (Koshkin et al, supra) 

(Scheme 2). Thus, a treatment of 7 with phosphoryl chloride according to the procedure reported 
by Martin (Helv. Chim. Acta 78:486, 1995) resulted in a high yield formation of 6-chloropurine 
derivative 12. The adenosine derivative 13 was derived from 12 after reaction with ammonia. 
Exemplary Analytical Data 

15 Data for compound 8 includes the following: mp 302-305 ""C (dec). NMR (DMSO- 

4): 88.16, (s, IH), 8.06 (s, IH), 7.30-7.20 (m, 5H), 5.95 (s, IH), 4.69 (s, IH), 4.63 (s, 2H), 4.28 
(s, IH), 3.95 (d,y=7.7, IH), 3.83 (m, 3H). '^C NMR (DMSO-^e): 5 156.6, 147.3, 146.1, 137.9, 
137.3, 128.3, 127.6, 127.5, 124.5, 88.2, 85.4, 77.0, 72.1, 71.3, 56.7. MALDI-MS m/z: (M+H)^. 
Anal. Calcd for C,8Hi8N405-5/12 H2O: C, 57,21; H, 5.02; N, 14.82. Found: C, 57,47; H, 4.95; 

20 N, 14.17. 

Analysis of compound 11 indicated that ^^P NMR (DMSO-^/e): 5 148.90. 
Scheme for Synthesis of LNA-I * 
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^ Keys: (a) hypoxantine, BSA, TMSOTf, 1,2-dichloromethane; 93%; (b) NaOH, 
THF, EtOH, H2O; 69%; (c) NaOBz, DMSO; 76%; (d) NaOH, THF, MeOH, H2O; 
85%; (e) DMT-Cl, pyridine; 92%; (f) Pd/C, HCO2NH4; 77%; (g) 2-cyanoethyl.7V,A^- 
diisopropyl-phosphoramidochloridite, DIPEA, DMF; 75%. 


Exemplary Experimental Conditions 

(7if,5/?,^if,75)-7-(2-Cyanoethoxy(diisopropyIamino)phosphinoxy)-l-(4,4'- 
dimethoxytrityIoxymethyl)-3-(hyroxanthin-9-yI)-dioxabicycIo[2,2.1]heptane (11) 

Compound 10 (530 mg, 0.90 mmol, described previously, (see for example, WO 
00/56746) was dissolved in anhydrous EtOAc (5 mL) and cooled in an ice-bath. DIPEA 
(0.47 mL, 2.7 mmol) and (250 |xL, 1.1 mmol) were added under intensive stirring. 
Formation of insoluble material was observed, and CH2CI2 (3 mL) was added to produce a 
clear solution. More 2-cyanoethyl-7y,//-diisopropylphosphoramidochloridite (200 |iL, 0.88 
mmol) was added after one hour, and the mixture was stirred overnight. EtOAc (30 mL) 
was added, the mixture was washed with sat. NaHCOs (2 x 50 mL), brine (50 mL), dried 
(Na2S04), and concentrated to a solid residue. Purification by silica gel HPLC (1-5 
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%MeOH/CH2Cl2 v/v, containing 0.1% of pyridine) gave compound 11 (495 mg, 75%) as a 
white solid material. NMR (DMSO-^/6): 5 148.90. 
Synthesis of LNA-D 

Taking advantage of a high availabiHty of the natural deoxy- and riboguanosines, a 
5 number of effective methods were developed for their conversion into 2,6-diaminopurine (D) 
nucleosides (Fathi et al. Tetrahedron Lett. 31 :319, 1990; Gryaznov et al. Tetrahedron Lett., 
35:2489, 1994; and Lakshman et al. Org. Lett., 2:927, 2000). However, the production of LNA- 
G nucleoside is a multi-step synthetic procedure. 

1 0 Scheme for Synthesis of LNA-G 


15 



For the synthesis of LNA-D nucleoside, a novel synthesis method was developed that 
employed a common convergent scheme, related to the strategy used earlier for the synthesis of 
20 its anhydrohexitol counterpart (Boudou et al. Nucleic Acids Res. 27:1450, 1999). In particular, 
a properly protected carbohydride unit was conjugated with 6-chloro-2-aminopurine to give a 
stable 6-chloro intermediate derivative (scheme below) which was further converted into desired 
diaminopurine nucleoside. 


CI 



sugar sugar sugar donor 


Thus, it was shown that glycosylation of 2-chloro-6-aminopurine with compound 4 
resulted in highly stereoselective formation of the nucleoside derivative 14. To promote the ring 
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closing reaction, a solution of 14 in aqueous 1,4-dioxane was treated with 10-fold excess of 
sodium 


Synthesis of LNA-D* 


CI 
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,P-0(CH2)2CN 


5 (PO2N 

' Keys: (a) 2-chloro-6-aminopurine, BSA, TMSOTf, 1 ,2-dichloromethane; 90 %; (b) 
NaOH, 1 ,4-dioxane, H2O; 87%; (c) NaOBz, DMF; (d) NaNs, DMSO; (e) NaOH, EtOH; 
79% (three steps); (f) 10% Pd/C, HCO2NH4, MeOH, H2O; 84%; (g) 1. BzCl, pyridine; 2. 
NaOH, EtOH, pyridine; 62%; (h) DMT-Cl, pyridine; 80%; (i) 2-cyanoethyl-iy,iV- 
10 diisopropylphosphoramidochloridite, DIPEA, DMF; 74%. 


hydroxide to give bicyclic compound 15 in 87% yield. The standard reaction with sodium 
benzoate in hot DMF was then successfully applied for displacement of 5'-mesylate of 15. 
Notably, this reaction proceeded in very selective manner and no side products originating from 


159 


the modification of the nucleobase were detected. The desired compound 16 was precipitated 
from the reaction mixture after addition of water. In order to introduce the 6-amino group into 
nucleobase structure, intermediate 6-azido derivative 17 was synthesized via reaction of 16 with 
sodium azide. The nucleoside derivative 18 was isolated as a crystalline compound after 
5 saponification of the 5'-benzoate of 17. Subsequent catalytic hydrogenation of 18 on palladium 
hydroxide resulted in simultaneous reduction of 6-azido and 3'-benzyI groups to give LNA-D 
diol 19 after crystallization from water. By the use of peracelation method, 2- and 6-amino 
groups of 19 were benzoylated at the next step to give the nucleobase protected derivative 20, 
which was in the standard way further converted into phosphoramidite monomer 21. 

10 This phosphoramidite has been produced in a quantity of 0.5 grams. 
Exemplary Analytical Data 

Data for compound 19 includes the following: NMR (DMS0-d6): 5 7.81 (s, IH), 6.78 
(br s, 2H), 5.91 (br s, 2H), 5.71 (s, IH), 5.66 (br s, IH), 5.04 (br s, IH), 4.31 (s, IH), 4.20 (s, 
IH), 3.90 (d, y = 7.7 Hz, IH), 3.77 (m, 2H), 3.73 (d, J= 7.7 Hz, IH). ^^C NMR(DMS0"^/6): 5 

15 160.5, 156.2, 150.9, 134.2, 113.4, 88.3, 85.0, 79.3, 71.5, 70.0, 56.8. MALDI-MS m/z: 295.0 

(M+H)^ Anal. Calcd for CiiHhN604-1.5 H2O: C, 41,12; H, 5.33; N, 26.15. Found: C, 41.24; H, 
5.19; N, 25.80. 

The^^P NMR (DMSO-de) spectrum for compound 24 contained signals at 5 149.19 and 

148.98. 

20 Data for compound 23 includes the following: crystallized from MeOH. mp. 227.5-229 

°C (dec). ^H NMR (DMS0-d6): 5 8.60 (s, IH), 8.15 (s, IH), 6.64 (br s, 2H), 5.82 (s, IH), 5.71 
(br s, IH), 5.04 (br s, IH), 4.40 (s, IH), 4.21 (s, IH), 3.92 (d, J= 7.7 Hz, IH), 3.79 (m, 2H), 3.75 
(d,y=7.7 Hz, IH). ^^CNMR(DMS0-4): 8 160.6, 152.0, 149.4, 139.3, 127.1, 88.6, 84.8, 79.1, 
71.6, 70.2, 56.8. MALDI-MS m/z: 334.7 (M+H)^ 

25 For protected compound 23, the ^^P NMR {DMSO-de) spectrum has a signal at 148.93 

and 148.85. 

Exemplary Experimental Conditions 

fJ5,ilf,^if, 75)-3-(2-amino-6-chloropurin-9-yI)-7-benzyloxy-l-methanesulfonoxymeth 
d]Oxabicyclo[2.2.1]heptane (15) 

30 To a solution of compound 14 (40 g, 64.5 mmol) in 1,4-dioxane (300 mL) was added 1 M 

NaOH (350 mL). The mixture was stirred for one hour at 0 ^^C, neutralized with AcOH (40 mL), 


and washed with CH2CI2 (2 x 200 mL). The combined organic layers were dried (Na2S04) and 
concentrated under reduced pressure. The solid residue was purified by silica gel flash 
chromatography to give compound 15 (27.1 g, 87%) as a white solid material NMR (CDCI3): 
5 7.84 (s, IH), 7.32-7.26 (m, 5H), 5.91 (s, IH), 4.73 (s, IH), 4.66 (d,y = 11.7 Hz, IH), 4.61 {d,J 
5 = 11.7 Hz, IH), 4.59 (s, 2H), 4.31 (s, IH), 4.18 (d, J = 8.0 Hz, 2H), 3.99 (d,y= 7.9 Hz, IH), 3.05 
(s, 3H). NMR (CDCI3) 5 158.9, 152.2, 151.4, 139.1, 136.4, 128.4, 128.2, 127.7, 125.3, 86.5, 
85.2, 77.2, 76.8, 72.4, 72.1, 64.0, 37.7. MALDI-MS m/z 482.1 [M+Hf . 

(7*5,5/?,^/?, ZS)-3-(2-amino-6-chloropurin-9-yl)-l-benzoyloxyniethyl-7-benzyloxy-2,5- 
1 0 dioxabicyclo[2.2.1]heptane (16) 

A mixture of sodium benzoate (7.78 g, 54 mmol) and compound 15 13 g, 27 mmol) was 
suspended in anhydrous DMF (150 mL) and stirred for two hours at 105 °C. Ice-cold water (500 
mL) was added to the solution under intensive stirring. The precipitate was filtered off, washed 
with water, and dried in vacuo. The intermediate product 16 (8 g) was used for ext step without 

15 further purification. Analytical sample was additionally purified by silica gel HPLC (0-2% 
MeOH/CH2Cl2 v/v). ^HNMR (CDCI3) 57.98-7.95 (m, 2H), 7.79 (s, IH), 7.62-7.58 (m, IH), 
7.48-7.44 (m, 2H), 7.24 (m, 5H), 5.93 (s, IH), 4.80 (d, J= 12.6 Hz, IH), 4.77 (s, IH), 4.67 (d,/ 
= 11.9 Hz, IH), 4.65 (d,y= 12.6 Hz, IH), 4.56 (d, 7= 11.9 Hz, IH), 4.27 (d,y= 8.0 Hz, IH), 
4.25 (s, IH), 4.08 (d,y= 7.9 Hz, IH). *^CNMR (CDCI3) 6 165.7, 158.8, 152.1, 151.3, 138.9, 

20 136.4, 133.4, 129.4, 129.0, 128.5, 128.4, 128.2, 127.6, 125.4, 86.4, 85.7, 77.2, 76.7, 72.5, 72.3, 
59.5. MALDI-MS m/z 508.0 [M+Hf. 

^75,i/?,^/?,75)-3-(2-amino-6-azidopurin-9-yl)-7-benzyIoxy-l-hydroxymethyl-2,5- 
dioxabicyclo[2.2.1]heptane (18) 

All the amount of compound 16 from the previous experiment was dissolved in 
25 anhydrous DMSO (100 mL) and NaNa (5.4 g, 83 mmol) was added. The mixture was stirred for 
two hours at 100 °C and cooled to room temperature. Water (400 ml) was added, and the mixture 
was stirred for 30 minutes at 0 (ice-bath) to give a yellowish precipitate 17. The precipitate 
was filtered off, washed with water, and dissolved in THF (25 mL). 2M NaOH (30 mL) was then 
added to the solution, and after 1 5 minutes of stirring the mixture was neutralized with AcOH (4 
30 mL). The mixture was concentrated to approximately 1/2 of its volume and cooled in an ice-bath. 
The titel compound was collected by filtration, washed with cold water, and dried in vacuo. 
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Yield: 8.8 g (79% from 15). NMR (DMSO-^) 6 8.53 ( br s, 2H), 8.23 (s, IH), 7.31-7.26 (m, 
5HX 6.00 (s, IH), 5.26 (t, 7=5.7 Hz, IH), 4.76 (s, IH), 4.64 (s, 1HX4.31 (s, IH), 3.99 (d,y = 
7.9 Hz, IH), 3.88-3.85 (m, 3H). ^^CNMR (DMSO-de) 5 146.0, 144.0, 143.8, 137.9, 137.0, 
128.3, 127.7, 127.6, 1 12.3, 88.3, 85.6, 77.1, 77.0, 72.2, 71.4, 56.8. MALDI-MS m/z 384.7 
5 [M+H]"^ for 2,6-diaminopurine product, 410.5 [M+H]"". Anal. Calcd for CigHig N8O4: C, 52.68; 
H, 4.42; N, 27.30. Found: C, 52.62; H, 4.36; N, 26.94. 
^/5,5/?,-//?,75)-3-(2,6-Diaminopurin-9-yl)-7-hydroxy-l-hydroxymethy^^^ 
dioxabicyclo[2.2.1]heptane (19) 

To a suspension of compound 18 (8 g, 19.5 mmol) in MeOH (100 mL) were added 

10 Pd(0H)2/C (20%, 5.5 g) and HCO2NH4 (3g). The mixture was refluxed for 30 minutes and more 
HCO2NH4 (3g) was added. After refluxing for further 30 minutes, the catalyst was filtered off 
and washed with boiling MeOH/HaO (1/1 v/v, 200 mL). The combined filtrates were 
concentrated to approximately 100 mL and cooled in an ice-bath. The precipitate was filtered off, 
washed with ice-cold H2O and dried in vacuo to give compound 19 (5.4 g, 94 %) as a white solid 

15 material. 'H NMR (DMSO-d6): 6 7.81 (s, IH), 6.78 (br s, 2H), 5.91 (br s, 2H), 5.71 (s, IH), 5.66 
(br s, IH), 5.04 (br s, IH), 4.31 (s, IH), 4.20 (s, IH), 3.90 (d, J = 7.7 Hz, IH), 3.77 (m, 2H), 3.73 
(d,J=7.7Hz, IH). ^^C NMR(DMSO-4)5 160.5, 156.2, 150.9, 134.2, 113.4, 88.3,85.0, 79.3, 
71.5, 70.0, 56.8. MALDI-MS m/z: 295.0 (M+H)^ Anal. Calcd for C11H14N6O4 I.5 H2O: C, 
41,12; H, 5.33; N, 26.15. Found: C, 41.24; H, 5.19; N, 25.80. 

20 (i5,Jif,^/?,75)-3-(2,6-Di-(iV-benzoyIamino)purin-9-yl)-7-hydroxy-l-hydroxyme 
dioxabicyclo[2.2.1]heptane (20) 

A solution of compound 19 (0.5 g, 1.7 mmol) in anhydrous pyridine (20 mL) was cooled 
in an ice-bath and benzoyl chloride (1.5 mL, 12.9 mmol) was added under intensive stirring. The 
mixture was allowed to warm to room temperature and was stirred overnight. Ethanol (20 mL) 

25 and 2 M NaOH (20 mL) were added, and the mixture was stirred for an additional hour. EtOAc 
(75 mL) was added and the solution was washed with water (2 x 50 mL). The combined aqueous 
layers were washed with CH2CI2 (2 x 50 mL). The combined organic phases were dried 
(Na2S04) and concentrated under reduced pressure to a solid residue. The residue was suspended 
in Et20 (75 mL, under refluxing for 30 minutes) and cooled in an ice-bath. The product was 

30 collected by filtration, washed with cold Et20, and dried in vacuo to give compound 20 (530 mg, 
62 %) as a slightly yellow solid material. 
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fi/?,ii?,-//?,75)-3-(2,6-Di-(A^-benzoyIamino)purin-9-yI).l-(4,4'^ 
hydroxy-2,5-dioxabicyclo[2.2.1]heptane (21) 

Compound 20 (530 mg, L06 mmol) was co-evaporated with anhydrous pyridine (2 x 20 
mL) and dissolved in anhydrous piridine (10 mL). DMT-Cl (600 mg, 1.77 mmol) was added, and 
5 the solution was stirred overnight at rt. The mixture was diluted with EtOAc (100 mL), washed 
with saturated NaHCOs (100 mL) and brine (50 mL). Organic layer was dried over Na2S04 and 
concentrated under reduced pressure. Purification by silica gel HPLC (20-1 00% EtOAc/hexane 
v/v, containing 0.1 % of pyridine) gave compound 21 (670 mg, 79%) as a white solid material. 
NMR (CD3OD): 5 8.41 (s, IH), 8.15-8.03 (m, 4H), 7.71-7.22 (m, 15H), 6.92-6.86 (m, 4H), 
10 6.23 (s, IH), 4.77 (s, IH), 4.62 (s, IH), 4.03 (d,y= 7.9 Hz, IH), 3.99 (d,/= 7.9 Hz, IH), 3.79 (s, 
6H), 3.67 (d,/= 10.9 Hz, IH), 3.54 (d,y= 10.8 Hz, IH),. MALDI-MS m/z: 826 (M+Na)^ Anal. 
Calcd for C46H4oN608-H20: C, 67.14; H, 5.14; N, 10,21. Found: C, 67.24; H, 4.97; N, 10.11. 
(il?,i/f,^/?,75)-7-(2-Cyanoethoxy(diisopropylamino)phosphinoxy)-3-(2,6-di-(7V- 
benzoylamino)purin-9-yl)-l-(4,4*-dimethoxytntyIoxymethyl)-2,5- 
1 5 d]oxabicyclo[2.2.1]heptane (21) . 

To a stirred solution of compound 20 (640 mg, 0.8 mmol) in anhydrous DMF (5 mL) 
were added DIPEA (420 L, 2.4 mmol) and 2-cyanoethyl-A^,A^- 

diisopropylphosphoramidochloridite (300 )xL, 1.2 mmol). The mixture was stirred for 1.5 hours 

at room temperature, diluted with EtOAc (100 mL), and washed with saturated NaHCOs (2 x 
20 100 mL) and brine (50 mL). Organic layer was dried (Na2S04) and concentrated under reduced 

pressure to give a yellow solid residue. Purification by silica gel HPLC (20-100 % 

EtOAc/hexene containing 0.1 % of pyridine) gave compound 21 (590 mg, 74%) as a white solid 

material. ^'P NMR (DMS0-d6) 5 149.19, 148.98. 

Synthesis of Pac-protected LNA-D amidite 
25 The following scheme illustrates a method for synthesizing a Pac-protected version of 

LNA-D amidite. 
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I — 20: R=H 

I— ^ 21 : R = P(0(CH2)2CN)N(iPr) 2 


Compound 17 

Compound 7 (Ig, 3.39 mmol) was co-evaporated with anhydrous DMF (2x10 mL) and 
dissolved in DMF (10 mL). Imidazole (0.69 g, 10.17 mmol) and l,3-dichloro-l,l,33- 
tetraisopropyldisiloxane (1.4 mL, 4.37 mmol) were added, and the mixture was stirred ovemight. 
H2O (100 mL) was added under intensive stirring to precipitate nucleoside material. The 
precipitate was filtered off, washed with H2O, and dried in vacuo. Crystallization from ethanol 
gave compound 17 (1.15 g, 63%) as a white solid material. MALDI-MS: m/z 537.3 (M+H)^. 

Compound 18 

To a solution of compound 17 (1.15 g, 2.14 mmol) in anhydrous pyridine (5 mL) was 
added phenoxyacetic anhydride (2 g, 7.0 mmol) and the mixture was stirred for four hours. 
EtOAc (100 mL) was added, and the solution was washed with sat. NaHCOa (2 x 100 mL), brine 
(50 mL), dried (Na2S04), and concentrated to a solid residue. Purification by silica gel HPLC 
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(50-100% v/v EtOAc/hexane) gave compound 18 (1.65 g, 95%) as a white solid material. 
MALDI-MS: m/z 827.3 (M+Na)". 

(75,5/?,^if,74S)-3-(2,6-Di-(A^-phenoxyacetylamino)purin-9-yl)-7-hydroxy-l-hydr^ 
2,5-dioxabicycIo[2.2.1]heptane (19) 
5 To a solution of compound 18 (0.96 g, 1.19 mmol) in anhydrous THF (10 mL) was added 

EtaN 3HF (0.2 mL) and the mixture was stirred overnight at room temperature. The formed 
precipitate was collected by filtration and washed with THF (5 mL) and pentane (5 mL) to give 
after drying compound 19 (650 mg, 97%) as a white solid material. MALDI-MS: m/z 563.0 
(M+H)^. 

10 (7/f,i/f,^7f,75)-3-(2,6-Di-(A^-phenoxyacetylamino)-purin-9-yl)-l-(4,4'- 

dimethoxytrityloxymethyl)-7-hydroxy-2,5-dioxabicyclo[2.2.1]heptane (20) 

To a solution of compound 19 (650 mg, 1.15 mmol) was added DMT-Cl (500 mg, 1.48 
mmol). The mixture was stirred for five hours, diluted with EtOAc (100 mL), and washed with 
sat, NaHCOa (2 x 100 mL). The organic layer was dried and concentrated to a solid residue. 

15 Crystallization from EtOAc gave compound 20 (810 mg, 81%) as a white solid material. 
(iif,J/?,^/f,75)-7-(2-Cyanoethoxy(diisopropylamino)phosphinoxy)-3--(2,6-di-(A^- 
phenoxyacetylamino)-purin-9-yl)-l-(454'-dimethoxytrityloxymethyl)-2,5- 
dioxabicyclo[2.2.1]heptane (21) 

To a solution of compound 20 (800 mg, 0.92 mmol) in anhydrous DMF (10 mL) were 

20 added 0.75 M solution of DCI in EtOAc (0.7 mL) and 2-cyanoethyl 

tetraisopropylphosphorodiamidite (0.32 mL, 1.01 mmol). The mixture was stirred at room 
temperature ovemight and EtOAc (75 mL) was added. The resulting solution was washed with 
sat. NaHCOs and brine, dried and concentrated to a solid residue. Purification by silica gel HPLC 
(30-100% v/v EtOAc/hexane, containing 0.1% of pyridine) gave phosphoramidite 21 (550 mg, 

25 56%) as a white solid material. 

^*PNMR(DMS0-J6): 5 149.08, 148.8. 
Synthesis ofLNA-lAP 

The intermediate derivative 16 was also used for the synthesis of LNA-2AP nucleoside. 
First, the 5'-0-benzoyl group of 16 was hydrolyzed by aqueous sodium hydroxide to give the 

30 nucleoside derivative 22 in 72% yield. The conditions of catalytic transfer hydrogenation 
usually used for removal of the 3'-0-benzyl group turned out to be suitable for complete 
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dechlorination of the nucleobase of 22. Thus, totally deprotected LNA-2AP nucleoside 23 was 
afforded in high yield after refluxing of the methanolic solution of 22 in the presence of 
paladium hydroxide and ammonium formate. The 2-amine of 23 was selectively protected with 
an amidine group after treatment with A^,A^-dimethylformamide dimethyl acetal. The resulting 
5 diol 24 was then 5'-0-DMT protected and 3'-0-phosphitylated to yield the desired 

phosphoramidite LNA-2AP monomer 25 (McBride et al, J. Am. Chem. Soc. 108:2040, 1986). 

Synthesis of LNA-2AP^ 



,P-0(CH2)2CN 

10 

'Keys: (a) NaOH, 1,4-dioxane, H2O; 72%; (b) 20% Pd(OH)2/C, HCO2NH4, MeOH, 
H2O; 89%; (c) 7V;Ar-dimethylformamide dimethyl acetal, DMF; (d) DMT-Cl, 
pyridine; 87% (two steps); (e) 2-cyanoethyl-MiN^- 
15 diisopropylphosphoramidochloridite, DIPEA, DMF; 64%. 

Exemplary Experimental Conditions 

fJ5',iif,^i?,75)-3-(2-amino-6-chIoropurin-9-yl)-7-benzyloxy-l-hydroxymethyl-2,5- 
dioxabicyclo[2.2.1]heptane (22) 

20 To a solution of compound 16 (3 g, 5.92 mmol) in 1,4-dioxane (20 mL) was added 2 M 

NaOH (20 mL) and the mixture was stirred for one hour. AcOH (3 mL) was added, and the 
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solvents were removed under reduced pressure. The solid residue was re-dissolved in 20% 
MeOH/EtAc (50 mL), washed with NaHCOa (2 x 50 mL), dried (Na2S04) and concentrated to a 
solid residue. The residue was purified by silica gel column chromatography (1-2% MeOH/EtAc 
v/v) to give compound 22 (1 .72 g, 72%) as a white solid material. 
5 ('i5',ii?,^/?,75)-3-(2-aminopurin-9-yI)-7-hydroxy-l-hydroxyniethyl-2,5- 
dioxabicyclo[2.2.1]heptane (23) 

To a solution of compound 22 (0.72 g, 1.79 mmol) in MeOH/dioxane (1/1 v/v) were 
added Pd(0H)2/C (20%, 0.5 g) and HCO2NH4 (1.5 g, 23.8 mmol). The mixture was stirred under 
refluxing for 30 minutes and cooled to room temperature. The catalyst was filtered off and 

10 washed with MeOH. The combined filtrates were concentrated under reduced pressure to yield 
compound 23 (0.44 g, 89 %) as a white solid material. Analytical sample was crystallized from 
MeOH. mp. 227.5-229 (dec). NMR (DMSO-de): 6 8.60 (s, IH), 8.15 (s, IH), 6.64 (br s, 
2H), 5.82 (s, IH), 5.71 (br s, IH), 5.04 (br s, IH), 4.40 (s, IH), 4.21 (s, IH), 3.92 (d, J= 1.1 Hz, 
IH), 3.79 (m, 2H), 3.75 (d,y= 7.7 Hz, IH). ^^C NMR(DMS0-rf6): 5 160.6, 152.0, 149.4, 139.3, 

15 127.1, 88.6, 84.8, 79.1, 71.6, 70.2, 56.8. 

(7/?,5if,^i?,75)-l-(4,4'-dimethoxytrityIoxymethyl)-3-(2-7V- 

(dimethylaminomethyIidene)aminopurin-9-yI)-7-hydroxy-2,5-dioxabicyclo[2.2.1]heptane 
(5' DMT protected version of 24) 

Compound 23 (0.4 g, 1.43 mmol) was co-evaporated with anhydrous DMF (10 mL) and 
20 dissolved in DMF (15 mL). A/,A^-Dimethylformamide dimethylacetal (0.8 mL) was added and the 
solution was stirred for three days at room temperature. Water (5 mL) was added, and the 
solvents were removed under reduced pressure. The solid residue was co-evaporated with 
anhydrous pyridine (2x10 mL) and dissolved in anhydrous pyridine (5 mL). DMT-Cl (0.7 g, 2.1 
mmol) was added, the solution was stirred for four hours, diluted with EtOAc (50 mL), and 
25 washed with NaHCOs (2 x 50 mL) and brine (50 mL). Organic layer was dried (Na2S04) and 
concentrated to a yellow solid residue. Purification by silica gel HPLC (1-6% MeOH/CH2Cl2 
v/v, containing 0.1% of pyridine) gave the 5' DMT protected version of compound 24 (0.87 g, 
87%) as a white solid material. 


167 


(7/?,i/f,^if,ZS)-7-(2-Cyanoethoxy(diisopropyIainino)phosphinoxy)-l-(4,4*- 
dimethoxytrityloxymethyl)-3-(2-A^-(dimethyIaminomethylidene)aminopurin-9-yI) 
dioxabicyclo[2.2.1]heptane (25) 

The 5' DMT protected version of compound 24 (0.5 g, 0.79 nunol) was dissolved in 
5 anhydrous DMF (10 mL) and DIPEA (350 ^iL) and 2-cyanoethyl-A/,A^- 

diisopropylphosphoramidochloridite (250 ^iL) were added. The mixture was stirred for one hour, 
diluted with EtOAc (50 mL), washed with saturated NaHCOa (2 x 100 mL) and brine (50 mL), 
dried (Na2S04), and concentrated to a solid residue. Purification by silica gel HPLC (0-3% 
MeOH/CH2Cl2 v/v, containing 0:1% of pyridine) gave compound 25 (0.42 g, 64%) as a white 
10 solid material. ^^P NMR (DMSO-c/e) 5 148.93, 148.85. 

Synthesis of Oligomers 

Along with previously described LNA phosphoramidites (Koshkin et al.^ supra; and 
Pedersen et al. Synthesis p. 802, 2002), the phosphoramidite monomers 11, 21, and 25 were 

15 successfully applied for automated oligonucleotide synthesis (Caruthers, Acc. Chem. Res. 

24:278, 1991) to produce the LNA oligomers depicted in Table 9. Oligonucleotide syntheses 
were performed on a 0.2 jxmol scale using an Expedite synthesizer (Applied Biosystems) with 
the recommended commercial reagents. Standard protocols for DNA synthesis were used, 
except that the coupling time was extended to 5 minutes and the oxidation time was extended to 

20 30 second cycles. Deprotection of the oligonucleotides were performed by treatment with 

concentrated ammonium hydroxide for five hours at 60 **C. After that, the LNA-D containing 
oligonucleotides were additionally treated with AMA (concentrated ammonium hydroxide / 40% 
aqueous MeNH2; 1/1 v/v) for one hour at 60'*C. All the synthesized oligonucleotides were 
purified by RP-HPLC, and their structures were verified by MALDI-TOF mass spectra. 

25 The complexing properties of oligonucleotides containing new LNA monomers 1-3 were 

assessed. Comparative binding data from an 8-mer LNA sequence is shown in Table 9 as the 
melting temperatures against complementary single stranded DNA. An exemplary sequence for 
this comparison is GACATAGG, which is the central part of a capture probe used for SNP 
detection in GluclVS7-7asA (A:a mismatch position). The thermal stabilities of reference DNA 

30 duplexes (entries 1-7, Table 9) can be directly compared with their LNA counterparts (entries 8- 
14). The hybridizing ability of all LNA 8-mers is superior to that of isosequencial DNA 
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oligonucleotides. The average melting temperatures of DNA and LNA 8-mers against 
complementary DNAs typically differ by about 40 °C. The replacement of one internal LNA-A 
nucleotide by LNA-D resulted in the further stabilization of the complementary duplex (i.e., 
compare entries 8 and 1 1) by 6.2 ""C. Interestingly, the analogous replacement made in an DNA 

5 octamer destabilized the corresponding duplex by 0.5 °C (i.e., entries 1 and 4). D-nucleosides 
may facilitate a B to A helix transition, because the A-type structure of an LNA:DNA duplex is 
more suitable for effective D:t pairing. This stabilizing effect is expected to be even more 
pronounced for LNA:RNA duplexes, which can be very useful for construction of antisense or 
other gene-silencing reagents. The mismatch discrimination ability of the D-nucleoside was also 

10 studied (entry 11). In comparison to LNA-A (entry 8) D-nucleoside demonstrated remarkable 
increased mismatch discrimination against DNA-g nucleoside. 

Table 9. Melting temperatures (Tm) of the complementary DNA-DNA and LNA-DNA 
duplexes.^ Modified monomers (LNA are in CAPITALS): I = inosine; D = 2,6-diaminopurine; X 
1 5 = 2-aminopurine. 


Entry 

Oligonucleotide 

structure 

Tm (± 0.5 °C) of the duplexes with complementary 
deoxynucleotide 

3'-ctgtatcc 

3'-ctgaatcc 

3'-ctggatcc 

3'-ctgcatcc 

1 

5'-gacatagg 

23.8 

<10 

<10 

<10 

2 

5'-gacttagg 

<10 

22.6 

<10 

<10 

3 

5'-gacgtagg 

<10 

<10 

<10 

25.0 

4 

5'-gacdtagg 

23.3 

<10 

<10 

<10 

5 

5'-gdcdtdgg 

33.4 

<10 

<10 

17.7 

6 

5'-gacitagg 

<10 

<10 

<10 

20.9 

7 

5'-gacxtagg 

<10 

<'i6 

<10 

<10 

S 

5'-GACATAGG 

61.6 

38.2 

43.4 

40.6 

9 

5'-GACTTAGG 

28.0 

60.7 

36.4 

23.5 

10 

5'-GACGTAGG 

55.0 

32^ 


70.9 

11 

5'-GACDTAGG 

67.8 

42.2 

41.4 

52.4 
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Tm (± 0.5 "C) of the duplexes with complementary 
deoxynucleotide 

12 

5*-GDCDTDGG 

78.3 

55.9 

54.7 

63.8 

13 

5'-GACITAGG 

53.1 

48.2 

43.0 

59.9 

14 

5'-GACXTAGG 

60.8 

45.5 

44.0 

53.9 


^ The melting temperatures (Tm values) were obtained as a maxima of the first derivative of the 
corresponding melting curves (optical density at 260 nm versus temperature). Concentration of 
the duplexes: 2.5 [iM. Buffer: 0.1 M NaCl; 10 mM Na-phosphate (pH 7.0); 1 mM EDTA. 
5 ^ Low cooperativity of transitions (accuracy ± 1 °C). 


Table 10. The mismatch discrimination effect of the chimeric LNA-DNA 12-mers containing 
LNA-A or LNA-D nucleosides against the point of mutation (SEQ ID NOs: ). 


The structure of 

LNA-DNA 

oligonucleotide 

Tm (± 0.5 °C) of the complementary duplexes with DNA 
oligonucleotides (A Tm between singly mismatched and perfect 
duplexes) 

HNFasl28A-2 


caacatcccaca 

caacaacccaca 

tGtggGATGttg 

61.0 

45.9 (-15.1) 

tGtggGDTGttg 

65.5 

49.7 (-15.8) 

Gluc53as-A 


aagagtccagtg 

aagaggccagtg 

cAmCtgGAmCtctt 

61.5 

50.6 (-10.9) 

cAmCtgGDmCtctt 

65.3 

45.4 (-19.9) 


10 

^ Concentration of duplexes: 2 |iM; Buffer: see Table 9. 

Table 11. Melting temperatures of the LNA and DNA duplexes (LNAs are CAPITALIZED) 
containing 2-thio-deoxythymidine (s) and diaminopurineriboside (d). See Table 9 for 
15 experimental conditions. 
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oligo structure 

Tm (± 0.5 "C ) of complementary duplexes with 

3'-ctgtatcc 

3'-ctgsatcc 

3'-CTGsATCC 

3'-CTGtATCC 

3'-CTGTATCC 

5'-gacatagg 

23.8 

27 

54.4 

49.4 

54.6 

5'-gacdtagg 

23.3 

<6 

45,4 

55.2 

60.5 

5'-GACATAGG 

61.6 

64.6 

87" 

88 

88 

5'-GACDTAGG 

67.8 

59.4 

80 

>90 

>90 


* Tm values in the shaded cells were measured in low salt buffers (1 mM Na-phosphate, pH 7.0). 
Low cooperativity of the transitions was observed (accuracy ±1 .5°C) 

5 Example 22: Exemplary Methods for Synthesizing LNA-PyrroIoPyr-SBC-C 

The furanopyrimidine phosphoramidite 6pC used for incorporation of the pyrroloC 
analogue can be synthesized from LNA-U through a series of reactions as illustrated below and 
in Fig. 6. Starting from LNA-U IpC iodine can be introduced on the 5 position on the 
nucleobase (Chang and Welch, J. Med. Chem. 1963, 6, 428). This compound can be used in a 

1 0 Sonogashira type palladium coupling reaction (Sonogashira, Tohda and Hagihara, Tetrahedron 
Lett. 1975, 4467) resulting in the 5-ethynyl-LNA-U 3pC. The 5-ethynyl-LNA-U 3pC can be 
transformed to the furanopyrimidie LNA analogue 4pC when reacted with Cul, and then 
transformed into the DMT-protected phosphoramidite 6pC (Woo, Meyer, and Gamper, Nucleic 
Acids Res., 1996, 24, 2470). LNA-PyrroloPyr-SBC-C is formed when 6pC or an 

1 5 oligonucleotide containing 6pC is deprotected with ammonia. 
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6pC 

Example 23: Exemplary Modified Bases such as Universal Bases 


Desirable modified bases are covalently linked to the I'-position of a furanosyl ring, 
particularly to the T-position of a 2*,4'-linked furanosyl ring, especially to the T-position of a 2- 
0,4*-C-methylene-beta-D-ribofiiranosyl ring. 
5 As discussed above, other desirable modified bases contain one or more carbon alicyclic 

or carbocyclic aryl units, i.e. non-aromatic or aromatic cyclic units that contain only carbon 
atoms as ring members. Modified bases that contain carbocyclic aryl groups are generally 
desirable, particularly a moiety that contains multiple linked aromatic groups, particularly groups 
that contain fused rings. That is, optionally substituted polynuclear aromatic groups are 
10 especially desirable such as optionally substituted naphthyl, optionally substituted anthracenyl, 
optionally substituted phenanthrenyl, optionally substituted pyrenyl, optionally substituted 
chrysenyl, optionally substituted benzanthracenyl, optionally substituted dibenzanthracenyl, 
optionally substituted benzopyrenyl, with substituted or unsubstituted pyrenyl being particularly 
desirable. 

15 Without being bound by any theory, it is believed that such carbon alicyclic and/or 

carbocyclic aryl modified bases can increase hydrophobic interaction with neighboring bases of 
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an oligonucleotide. Those interactions can enhance the stability of a hybridized oligo pair, 
without necessity of interactions between bases of the distinct oligos of the hybridized pair. 

Again without being bound by any theory, it is further believed that such hydrophobic 
interactions can be particularly favored by platelike stacking of neighboring bases, i.e. 
5 intercalation. Such intercalation will be promoted if the base comprises a moiety with a 
relatively planar extended structure, such as provided by an aromatic group, particularly a 
carbocyclic aryl group having multiple fused rings. This is indicated by the increases in Tm 
values exhibited by oligos having LNA units with pyrenyl nucleobases relative to comparable 
oligos having LNA units with naphthyl nucleobases. 

10 Modified bases that contain one or more heteroalicyclic or heteroaromatic groups also are 

suitable for use in LNA units, particularly such non-aromatic and aromatic groups that contains 
one or more N, O or S atoms as ring members, particularly at least one sulfur atom, and fi-om 5 
to about 8 ring members. Also desirable is a nucleo base that contains two or more fused rings, 
where at least one of the rings is a heteroalicyclic or heteroaromatic group containing 1, 2, or 3 

15 N, O, or S atoms as ring members. 

In general, desirable are modified bases that contain 2, 3, 4, 5, 6, 7 or 8 fused rings, 
which may be carbon alicyclic, heteroalicyclic, carbocyclic aryl and/or heteroaromatic; more 
desirably modified bases that contain 3, 4, 5, or 6 fused rings, which may be carbon alicyclic, 
heteroalicyclic, carbocyclic aryl and/or heteroaromatic, and desirably the fused rings are each 

20 aromatic, particularly carbocyclic aryl. 

In some embodiments, the base is not an optionally substituted oxazole, optionally 
substituted imidazole, or optionally substituted isoxazole modified base. 

Other suitable modified bases for use in LNA units in accordance with the invention 
include optionally substituted pyridyloxazole, optionally substituted pyrenylmethylglycerol, 

25 optionally substituted pyrrole, optionally substituted diazole and optionally substituted triazole 
groups. 

Desirable modified bases of the present invention when incorporated into an 
oligonucleotide containing all LNA units or a mixture of LNA and DNA or RNA units will 
exhibit substantially constant Tm values upon hybridization with a complementary 
30 oligonucleotide, irrespective of the bases present on the complementary oligonucleotide. 
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In some embodiments, one or more of the common RNA or commonly used derivatives 
thereof, such as 2'-0-methyl, 2*-fluoro, 2'-allyl, and 2'-0-methoxyethoxy derivatives are 
combined with at least one nucleotide with a universal base to generate an oligonucleotide 
having between five to 1 00 nucleotides. 
5 Modified nucleic acid compounds may comprise a variety of nucleic acid units e.g. 

nucleoside and/or nucleotide units. As discussed above, an LNA nucleic acid unit has a carbon 
or hetero alicyclic ring with four to six ring members, e.g. a furanose ring, or other alicyclic ring 
structures such as a cyclopentyl, cycloheptyl, tetrahydropyranyl, oxepanyl, tetrahydrothiophenyl, 
pyrrolidinyl, thianyl, thiepanyl, piperidinyl, and the like. 

10 In an aspect of the invention, at least one ring atom of the carbon or hetero alicyclic 

group is taken to form a further cyclic linkage to thereby provide a multi-cyclic group. The 
cyclic linkage may include one or more, typically two atoms, of the carbon or hetero alicyclic 
group. The cyclic linkage also may include one or more atoms that are substituents, but not ring 
members, of the carbon or hetero alicyclic group. 

15 Unless indicated otherwise, an alicyclic group as referred to herein is inclusive of group 

having all carbon ring members as well as groups having one or more hetero atom (e.g. N, O, S 
or Se) ring members. The disclosure of the group as a "carbon or hetero alicyclic group" further 
indicates that the alicyclic group may contain all carbon ring members (i.e. a carbon alicyclic) or 
may contain one or more hetero atom ring members (i.e. a hetero alicyclic). Alicyclic groups are 

20 understood not to be aromatic, and typically are fully saturated within the ring (i.e. no endocyclic 
multiple bonds). 

Desirably, the alicyclic ring is a hetero alicyclic, i.e. the alicyclic group has one or more 
hetero atoms ring members, typically one or two hetero atom ring members such as O, N, S or 
Se, with oxygen being often desirable. 

25 The one or more cyclic linkages of an alicyclic group may be comprised completely of 

carbon atoms, or generally more desirable, one or more hetero atoms such as O, S, N or Se, 
desirably oxygen for at least some embodiments. The cyclic linkage will typically contain one or 
two or three hetero atoms, more typically one or two hetero atoms in a single cyclic linkage. 

The one or more cyclic linkages of a nucleic acid compound of the invention can have a 

30 number of alternative configurations and/or configurations. For instance, cyclic linkages of 

nucleic acid compounds of the invention will include at least one alicyclic ring atom. The cyclic 
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linkage may be disubstituted to a single alicyclic atom, or two adjacent or non-adjacent alicyclic 
ring atoms may be included in a cyclic linkage. Still further, a cyclic linkage may include a 
single alicyclic ring atom, and a further atom that is a substituent but not a ring member of the 
alicyclic group. 

5 For instance, as discussed above, if the alicyclic group is a furanosyl-type ring, desirable 

cyclic linkages include the following: C-l', C-2'; C-2\ C-3'; C-2\ C-4'; or a C-2', C-5' linkage. 

A cyclic linkage will typically comprise, in addition to the one or more alicyclic group 
ring atoms, 2 to 6 atoms in addition to the alicyclic ring members, more typically 3 or 4 atoms in 
addition to the alicyclic ring member(s). 
10 The alicyclic group atoms that are incorporated into a cyclic linkage are typically carbon 

atoms, but hetero atoms such as nitrogen of the alicyclic group also may be incorporated into a 
cyclic linkage. 

Specifically desirable modified nucleic acids for use oligonucleotides of the invention 
include locked nucleic acids as disclosed in W099/14226 (which include bicyclic and tricyclic 

15 DNA or RNA having a 2-4* or 2'-3' sugar linkages); 2'-deoxy-2 -fluoro ribonucleotides; T-O- 
methyl ribonucleotides; 2*-0-methoxyethyl ribonucleotides; peptide nucleic acids; 5-propynyl 
pyrimidine ribonucleotides; 7-deazapurine ribonucleotides; 2,6-diaminopurine ribonucleotides; 
and 2-thio-pyrimidine ribonucleotides. 

LNA units as disclosed in WO 99/14226 are in general particularly desirable modified 

20 nucleic acids for incorporation into an oligonucleotide of the invention. Additionally, the 

nucleic acids may be modified at either the 3' and/or 5' end by any type of modification known 
in the art. For example, either or both ends may be capped with a protecting group, attached to a 
flexible linking group, attached to a reactive group to aid in attachment to the substrate surface, 
etc. Desirable LNA units also are disclosed in WO 0056746, WO 0056748, and WO 0066604. 

25 Desirable syntheses of pyrene-LNA monomers is shown in the following Schemes 1 and 

2. hi the below Schemes 1 and 2, the compound reference numerals are also referred to in the 
examples below. 
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e 

2,4,5-trimethylphenyl 


Scheme 2 

A wide variety of modified nucleic acids may be employed, including those that have 2- 
modification of hydroxyl, 2'-0-methyl, 2'-fluoro, 2'-trifluoromethyl, 2'-0-(2-methoxyethyl)5 2'- 
0-aminopropyl, 2'-0-dimethylamino-oxyethyl, 2'-0-fluoroethyl or 2'-0-propenyl. The nucleic 
acid may further include a 3' modification, desirably where the 2*- and 3 -position of the ribose 
group is linked. The nucleic acid also may contain a modification at the 4'-position, desirably 
where the T- and 4*-positions of the ribose group are linked such as by a 2-4' link of -CH2-S-, - 
CH2-NH-, or -CH2-NMe- bridge. 

The nucleotide also may have a variety of configurations such as a-D-ribo, P-D-xylo, or 
a-L-xylo configuration. 

The intemucleoside linkages of the units of oligos of the invention may be natural 
phosphorodiester linkages, or other linkages such as -0-P(0)2-0-, -0-P(0,S)-0-, -0-P(S)2-0-, - 
NR".P(0)2-0-, -0-P(0,NR")-0-, -0-P0(R")-0-, -0-P0(CH3)-0-, and -0-PO(NHR^)-0-, where 
R" is selected fi-om hydrogen and CM-alkyl, and R" is selected from Ci.6-alkyl and phenyl. 

A further desirable group of modified nucleic acids for incorporation into oligomers of 
the invention include those of the following formula: 
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wherein X is -0-; B is a modified base as discussed above e.g. an optionally substituted 
carbocyclic aryl such as optionally substituted pyrene or optionally substituted 
5 pyrenylmethylglycerol, or an optionally substituted heteroalicylic or optionally substituted 
heteroaromatic such as optionally substituted pyridyloxazole. Other desirable universal bases 
include, pyrrole, diazole or triazole moieties, all of which may be optionally substituted. R^* is 
hydrogen. 

P designates the radical position for an intemucleoside linkage to a succeeding monomer, 

10 or a 5'-terminal group, such intemucleoside linkage or 5'-terminal group optionally including the 
substituent R^, R^ being hydrogen or included in an intemucleoside linkage. R"^* is a group P* 
which designates an intemucleoside linkage to a preceding monomer, or a 3'-terminal group. 
One or two pairs of non-geminal substituents selected from the present substituents of R^, R^*, 
, may designate a biradical consisting of 1-4 groups/atoms selected from -C(R^R^)-, - 

15 C(R>C(R')-, -C(R')=N-, -0-, -S-, -SO2-, -N(R')-, and >C=Z. Z is selected from -0-, and - 
N(R^)-, and R^ and R^ each is independently selected from hydrogen, optionally substituted Cue- 
alkyl, optionally substituted C2-6-alkenyl, hydroxy, Ci.6-alkoxy, C2.6-alkenyloxy, carboxy, C1.6- 
alkoxycarbonyl, Ci.6-alkylcarbonyl, formyl, amino, mono- and di(Ci.6-alkyl)amino, carbamoyl, 
mono- and di(Ci_6-alkyl)-amino-carbonyl, amino-Ci-6-alkyl-aminocarbonyl, mono- and di(Ci-6- 

20 alkyl)amino-Ci.6-alkyl-aminocarbonyl, Ci-6-alkyl-carbonylamino, carbamido, Cj^-alkanoyloxy, 
sulphono, Ci^-alkylsulphonyloxy, nitro, azido, sulphanyl, Ci-6-alkylthio, halogen, 
photochemically active groups, thermochemically active groups, chelating groups, reporter 
groups, and ligands, the possible pair of non-geminal substituents thereby forming a monocyclic 
entity together with (i) the atoms to which the non-geminal substituents are bound and (ii) any 

25 intervening atoms; and each of the substituents R^, R^*, R^, R"** which are present and not 

involved in the possible biradical is independently selected from hydrogen, optionally substituted 
Ci^-alkyl, optionally substituted Ca-e-alkenyl, hydroxy, Ci-e-alkoxy, C2-6-alkenyloxy, carboxy, 
Ci^-alkoxycarbonyl, Ci_6-alkylcarbonyl, formyl, amino, mono- and di(Ci_6-alkyl)amino, 
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carbamoyl, mono- and di(Ci-6-alkyl)-amino-carbonyl5 amino-Ci.6-alkyl-aminocarbonyl, mono- 
and di(Ci-6-alkyl)amino-Ci.6-alkyl-aminocarbonyl, Ci-6-alkyl-carbonylamino, carbamido, Ci,6- 
alkanoyloxy, sulphono, Ci.6-alkylsulphonyloxy, nitre, azido, sulphanyl, Ci.6-alkylthio, halogen, 
photochemically active groups, thermochemically active groups, chelating groups, reporter 
5 groups, and ligands; and basic salts and acid addition salts thereof. 

Modified nucleobases and nucleosidic bases may comprise a cyclic unit (e.g. a 
carbocyclic unit such as pyrenyl) that is joined to a nucleic unit, such as a l-position of 
fiirasonyl ring through a linker, such as a straight of branched chain alkylene or alkenylene 
group. Alkylene groups suitably having from 1 (i.e. -CH2-) to about 12 carbon atoms, more 
10 typically 1 to about 8 carbon atoms, still more typically 1 to about 6 carbon atoms. Alkenylene 
groups suitably have one, two or three carbon-carbon double bounds and from 2 to 12 carbon 
atoms, more typically 2 to 8 carbon atoms, still more typically 2 to 6 carbon atoms. 

Example 24: Exemplary Nucleic Acid Monomers and Oligomers 

1 5 Desirable LNA units include those that contain a furanosyl-type ring and one or more of 

the following linkages: C-l', C-2'; C-2', C-3'; C-2\ C-4'; or a C-2\ C-5' linkage. A C-2', is 
particularly desirable. In another aspect of the invention, desirable LNA units are compounds 
having a substituent on the 2 '-position of the central sugar moiety (e.g., ribose or xylose), or 
derivatives thereof, which favors the C3'-endo conformation, commonly referred to as the North 

20 (or simply N for short) conformation. Desirable LNA In various embodiments, the 

oligonucleotide has at least one LNA unit with a modified base as disclosed herein. Suitable 
oligonucleotides also may contain natural DNA or RNA units (e.g., nucleotides) with natural 
bases, as well as LNA units that contain natural bases. Furthermore, the oligonucleotides of the 
invention also may contain modified DNA or RNA, such as 2'-0-methyl RNA, with natural or 

25 modified nucleobases (e.g., pyrene). Desirable oligonucleotides contain at least one of and 

desirably both of 1) one or more DNA or RNA units (e.g., nucleotides) with natural bases, and 2) 
one or more LNA units with natural bases, in addition to LNA units with a modified base. In 
other embodiments, the nucleic acid does not contain a modified base. 

Oligonucleotides of the invention desirably contain at least 50 percent or more, more 

30 desirably 55, 60, 65, or 70 percent or more of non-modified or natural DNA or RNA units (e.g., 
nucleotides) or units other than LNA units based on the total number of units or residues of the 
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oligo. A non-modified nucleic acid as referred to herein means that the nucleic acid upon 
incorporation into a 10-mer oligomer will not increase the Tm of the oligomer in excess of l^C or 
2^C. More desirably, the non-modified nucleic acid unit (e.g., nucleotide) is a substantially or 
completely "natural" nucleic acid, i.e. containing a non-modified base of uracil, cytosine, 5- 
5 methyl-cytosine, thymine, adenine or guanine and a non-modified pentose sugar unit of P-D- 
ribose (in the case of RNA) or p-D-2-deoxyribose (in the case of DNA). 

Oligonucleotides of the invention suitably may contain only a single modified (i.e. LNA) 
nucleic acid unit, but desirably an oligonucleotide will contain 2, 3, 4 or 5 or more modified 
nucleic acid units. Typically desirable is where an oligonucleotide contains from about 5 to 

10 about 40 or 45 percent modified (LNA) nucleic acid units, based on total units of the oligo, more 
desirably where the oligonucleotide contains from about 5 or 10 percent to about 20, 25, 30 or 35 
percent modified nucleic acid units, based on total units of the oligo. 

Typical oligonucleotides that contain one or more LNA units with a modified base as 
disclosed herein suitably contain firom 3 or 4 to about 200 nucleic acid repeat units, with at least 

15 one unit being an LNA unit with a modified base, more typically from about 3 or 4 to about 5, 6, 
7, 8, 9, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 1 10, 120, 130, 140 or 150 nucleic acid units, 
with 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 LNA units with a modified base being present. 

As discussed above, particularly desirable oligonucleotides contain a non- modified DNA 
or RNA unit at the 3' terminus and a modified DNA or RNA unit at one position upstream from 

20 (generally referred to hereing as the -1 or penultimate position) the 3' terminal non-modified 
nucleic acid unit. In some embodiments, the modified base is at the 3' terminal position of a 
nucleic acid primer, such as a primer for the detection of a single nucleotide polymorphism. 
Other particularly desirable nucleic acids have an LNA unit with or without a modified base in 
the 5' and/or 3' terminal position. 

25 Also desirable are oligonucleotides that do not have an extended stretches of modified 

DNA or RNA units, e.g. greater than about 4, 5 or 6 consecutive modified DNA 
or RNA units. That is, desirably one or more non-modified DNA or RNA will be present after a 
consecutive stretch of about 3, 4 or 5 modified nucleic acids. 

Generally desirable are oligonucleotides that contain a mixture of LNA units that have 

30 non-modified or natural nucleobases (i.e., adenine, guanine, cytosine, 5 -methyl-cytosine, uracil, 
or thymine) and LNA units that have modified bases as disclosed herein. 


180 


Particularly desirable oligonucleotides of the invention include those where an LNA unit 
with a modified base is interposed between two LNA units each having non-modified or natural 
bases (adenine, guanine, cytosine, 5-methyl-cytosine, uracil, or thymine. The LNA "flanking" 
units with natural base moieties may be directly adjacent to the LNA with modified base moiety, 
or desirably is within 2, 3, 4 or 5 nucleic acid units of the LNA unit with modified base. Nucleic 
acid units that may be spaced between an LNA unit with a modified base and an LNA unit with 
natural nucleobasis suitably are DNA and/or RNA and/or alkyl-modified RNA/DNA units, 
typically with natural base moieties, although the DNA and or RNA units also may contain 
modified base moieties. 

The oligonucleotides of the present invention are comprised of at least about one 
universal base. Oligonucleotides of the present can also be comprised, for exmple, of between 
about one to six 2'-0me-RNA unit, at least about two LNA units and at least about one LNA 
pyrene unit. 

Example 25: Exemplary Target Nucleic Acids 

In the practice of the present invention, target genes may be suitably single-stranded or 
double-stranded DNA or RNA; however, single-stranded DNA or RNA targets are desirable. It 
is understood that the target to which the nucleic acids of the invention are directed include 
allelic forms of the targeted gene and the corresponding mRNAs including splice variants. There 
is substantial guidance in the literature for selecting particular sequences for nucleic acids with 
LNA or other high affinity nucleotides given a knowledge of the sequence of the target 
polynucleotide, e.g., Peyman and Ulmann, Chemical Reviews, 90:543-584, 1990; Crooke,^««. 
Rev. Pharmacol Toxicol, 32:329-376 (1992); and Zamecnik and Stephenson, Proc, Natl Acad, 
Set, 75:280-284 (1974). Desirable mRNA targets include the 5' cap site, tRNA primer binding 
site, the initiation codon site, the mRNA donor splice site, and the mRNA acceptor splice site, 
e.g., Goodchild et al., U.S. Patent 4,806,463. 

Example 26: Exemplary Applications of Present Methods 

The chimeric oligos of the present invention are highly suitable for a variety of diagnostic 
purposes such as for the isolation, purification, amplification, detection, identification, 
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quantification, or capture of nucleic acids such as DNA, mRNA or non-protein coding cellular 
RNAs, such as tRNA, rRNA, snRNA and scRNA, or synthetic nucleic acids, in vivo or in vitro. 
The oligomer can comprise a photochemically active group, a thermochemically active 
group, a chelating group, a reporter group, or a ligand that facilitates the direct or indirect 
5 detection of the oligomer or the immobilization of the oligomer onto a solid support. Such group 
are typically attached to the oligo when it is intended as a probe for in situ hybridization, in 
Southern hybridization, Dot blot hybridization, reverse Dot blot hybridization, or in Northern 
hybridization. 

When the photochemically active group, the thermochemically active group, the 
10 chelating group, the reporter group, or the ligand includes a spacer (K), the spacer may suitably 

comprise a chemically cleavable group. 

An additional object of the present invention is to provide oligonucleotides which 

combines an increased ability to discriminate between complementary and mismatched targets 

with the ability to act as substrates for nucleic acid active enzymes such as for example DNA and 
1 5 RNA polymerases, ligases, phosphatases. Such oligonucleotides may be used for instance as 

primers for sequencing nucleic acids and as primers in any of the several well known 

amplification reactions, such as the PGR reaction. 

Introduction of LNA monomers with natural bases into either DNA, RNA, or pure LNA 

oligonucleotides can result in extremely high thermal stability of duplexes with complementary 
20 DNA or RNA, while at the same time obeying the Watson-Crick base-pairing rules. In general, 

the thermal stability of heteroduplexes is increased 3-8^C per LNA monomer in the duplex. 

Oligonucleotides containing LNA can be designed to be substrates for polymerases (e.g.. Tag 

polymerase), and PCR based on LNA primers is more discriminatory towards single base 

mutations in the template DNA compared to normal DNA-primers (e.g., allele specific PCR). 
25 Furthermore, very short LNA oligos (e.g. 5-mers or 8-mers) which have high Tm's when 

compared to similar DNA oligos can be used as highly specific catching probes with outstanding 

discriminatory power towards single base mutations (e.g., SNP detection). 

LNA oligonucleotides are capable of hybridizing with double-stranded DNA target 

molecules as well as RNA secondary structures by strand invasion as well as of specifically 
30 blocking a wide selection of enzymatic reactions such as, digestion of double-stranded DNA by 
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restriction endonucleases; and digestion of DNA and RNA with deoxyribonucleases and 

ribonucleases, respectively. 

In a further aspect, oHgonucleotides of the invention may be used to construct new 

affinity pairs with exhibit enhanced specificity towards each other. The affinity constants can 
5 easily be adjusted over a wide range and a vast number of affinity pairs can be designed and 

synthesized. One part of the affinity pair can be attached to the molecule of interest (e.g. 

proteins, amplicons, enzymes, polysaccharides, antibodies, haptens, peptides, etc.) by standard 

methods, while the other part of the affinity pair can be attached to e.g. a solid support such as 

beads, membranes, micro-titer plates, sticks, tubes, etc. The solid support may be chosen fi-om a 
10 wide range of polymer materials such as for instance polypropylene, polystyrene, polycarbonate 

or polyethylene. The affinity pairs may be used in selective isolation, purification, capture and 

detection of a diversity of the target molecules. 

Oligonucleotides of the invention also may be employed as probes in the purification, 

isolation and detection of for instance pathogenic organisms such as viral, bacteria and fimgi etc. 
15 Oligonucleotides of the invention also may be used as generic tools for the purification, isolation, 

amplification and detection of nucleic acids fi'om groups of related species such as for instance 

rRNA fi-om gram-positive or gram negative bacteria, fiingi, mammalian cells etc. 

Oligonucleotides of the invention also may be employed as an aptamer in molecular 

diagnostics, e.g. in RNA mediated catalytic processes, in specific binding of antibiotics, drugs, 
20 amino acids, peptides, structural proteins, protein receptors, protein enzymes, saccharides, 

polysaccharides, biological cofactors, nucleic acids, or triphosphates or in the separation of 

enantiomers fi'om racemic mixtures by stereospecific binding. 

Oligonucleotides of the invention also may be used for labeling of cells, e.g. in methods 

wherein the label allows the cells to be separated from unlabelled cells. 
25 Oligonucleotides also may be conjugated to a compound selected from proteins, 

amplicons, enzymes, polysaccharides, antibodies, haptens, and peptides. 

Kits are also provided containing one or more oligonucleotides of the invention for the 

isolation, purification, amplification, detection, identification, quantification, or capture of 

natural or synthetic nucleic acids. The kit typically will contain a reaction body, e.g. a slide or 
30 biochip. One or more oligonucleotides of the invention may be suitably immobilized on such a 

reaction body, 
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The invention also provides methods for using kits of the invention for carrying out a 
variety of bioassays, e.g. for diagnostic purposes. Any type of assay wherein one component is 
immobilized may be carried out using the substrate platforms of the invention. Bioassays 
utilizing an immobilized component are well known in the art. Examples of assays utilizing an 
5 immobilized component include for example, immunoassays, analysis of protein-protein 

interactions, analysis of protein-nucleic acid interactions, analysis of nucleic acid-nucleic acid 
interactions, receptor binding assays, enzyme assays, phosphorylation assays, diagnostic assays 
for determination of disease state, genetic profiling for drug compatibility analysis, and SNP 
detection (U.S.P.N. 6,316,198; 6,303,315). 

10 Identification of a nucleic acid sequence capable of binding to a biomolecule of interest 

can be achieved by immobilizing a library of nucleic acids onto the substrate surface so that each 
unique nucleic acid was located at a defined position to form an array. The array would then be 
exposed to the biomolecule under conditions which favored binding of the biomolecule to the 
nucleic acids. Non-specifically binding biomolecules could be washed away using mild to 

1 5 stringent buffer conditions depending on the level of specificity of binding desired. The nucleic 
acid array would then be analyzed to determine which nucleic acid sequences bound to the 
biomolecule. Desirably the biomolecules would carry a fluorescent tag for use in detection of 
the location of the bound nucleic acids. 

Oligonucleotides of the invention can be employed in a wide range of applications, 

20 particularly those applications involving a hybridization reaction. Oligonucleotides also may be 
used in DNA sequencing aiming at improved throughput in large-scale, shotgun genome 
sequencing projects, improved throughput in capillary DNA sequencing (e.g. ABI prism 3700) 
as well as at an improved method for 1) sequencing large, tandemly repeated genomic regions, 2) 
closing gaps in genome sequencing projects and 3) sequencing of GC-rich templates. In DNA 

25 sequencing, oligonucleotide sequencing primers are combined with LNA enhancer elements for 
the read-through of GC-rich and/or tandemly repeated genomic regions, which often present 
many challenges for genome sequencing projects. LNA may increase the specificity of certain 
sequencing primers and thus facilitate selection of a particular version of a repeated sequence 
and possibly also use strand invasion to open up recalcitrant GC rich sequences. 


184 


The incorporation of one or more universal nucleosides into the oligomer makes bonding 
to unknown bases possible and allows the oligonucleotide to match ambiguous or unknown 
nucleic acid sequences. 

As discussed above, oligonucleotides of the invention may be used for therapeutic 
5 applications, e.g. as an antisense, antigene or ribozyme or double stranded nucleic acid 

therapeutic agents. In these therapeutic methods, one or more oligonucleotides of the invention 
is administered as desired to a patient suffering from or susceptible the targeted disease or 
disorder, e.g. a viral infection. 

In an exemplary in vitro method for measuring the ability of a nucleic acid of the 

10 invention to silence a target gene, cells are cultured in standard medium supplemented with 1% 
fetal calf serum as previously described (Lykkesfeld et al. Int. J. Cancer 61 :529-534, 1995). At 
the start of the experiment cells are approximately 40% confluent. The serum containing 
medium is removed and replaced with serum-free medium. Transfection is performed using, 
e.g., Lipofectin (GibcoBRL cat. No 18292-01 1) diluted 40X in medium without serum and 

1 5 combined with the oligo to a concentration of 750 nM oligo, 0.8 ug/ml Lipofectin. Then, the 
medium is removed from the cells and replaced with the medium containing oHgo-Lipofectin 
complex. The cells are incubated at 37^C for 6 hours, rinsed once with medium without serum 
and incubated for a further 18 hours in DME/F12 with 1% FCS at 37°C. Standard methods are 
used to measure the level of mRNA or protein encoded by the target gene to measure the level of 

20 gene silencing. 

It is also contemplated that information on the structures assumed by a target nucleic acid 
may be used in the design of the probes, such that regions that are known or suspected to be 
involved in folding may be chosen as hybridization sites. Such an approach will reduce the 
number of probes that are likely to be needed to distinguish between targets of interest. 

25 There are many methods used to obtain structural information involving nucleic acids, 

including the use of chemicals that are sensitive to the nucleic acid structure, such as 
phenanthroline/copper, EDTA-Fe^"*^, cisplatin, ethylnitrosourea, dimethyl pyrocarbonate, 
hydrazine, dimethyl sulfate, and bisulfite. Enzymatic probing using structure-specific nucleases 
from a variety of sources, such as the Cleavase™ enzymes (Third Wave Technologies, Inc., 

30 Madison, Wis.), Taq DNA polymerase, E. coli DNA polymerase I, and eukaryotic structure- 
specific endonucleases (e.g., human, murine and Xenopus XPG enzymes, yeast RAD2 enzymes), 
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murine FEN-1 endonucleases (Harrington and Lieber, Genes and Develop., 3:1344 [1994]) and 
calf thymus 5* to 3' exonuclease (Murante et al., J. Biol. Chem., 269:1 191 [1994]). In addition, 
enzymes having 3' nuclease activity such as members of the family of DNA repair endonucleases 
(e.g., the RrpI enzyme from Drosophila melanogaster, the yeast RAD 1 /RAD 10 complex and E. 
5 coli Exo III), are also suitable for examining the structures of nucleic acids. 

If analysis of structure as a step in probe selection is to be used for a segment of nucleic 
acid for which no information is available concerning regions likely to form secondary 
structures, the sites of structure-induced modification or cleavage must be identified. It is most 
convenient if the modification or cleavage can be done under partially reactive conditions (i.e., 

10 such that in the population of molecules in a test sample, each individual will receive only one or 
a few cuts or modifications). When the sample is analyzed as a whole, each reactive site should 
be represented, and all the sites may be thus identified. Using a Cleavase Fragment Length 
Polymorphism™ cleavage reaction as an example, when the partial cleavage products of an end 
labeled nucleic acid fragment are resolved by size (e.g., by electrophoresis), the result is a ladder 

1 5 of bands indicating the site of each cleavage, measured fi-om the labeled end. Similar analysis 
can be done for chemical modifications that block DNA synthesis; extension of a primer on 
molecules that have been partially modified will yield a nested set of termination products. 
Determining the sites of cleavage/modification may be done with some degree of accuracy by 
comparing the products to size markers (e.g., commercially available fragments of DNA for size 

20 comparison) but a more accurate measure is to create a DNA sequencing ladder for the same 
segment of nucleic acid to resolve alongside the test sample. This allows rapid identification of 
the precise site of cleavage or modification. 

Example 27: General Reaction Conditions for Synthesis of Some Compounds of the 
25 Invention 

Reactions were conducted under an atmosphere of nitrogen when anhydrous solvents 
were used. All reactions were monitored by thin-layer chromatography (TLC) using EM reagent 
plates with fluorescence indicator (SiO2-60, F-254). The compounds were visualized under UV 
light and by spraying with a mixture of 5% aqueous sulfiiric acid and ethanol followed by 
30 heating. Silica gel 60 (particle size 0.040-0.063 mm, Merck) was used for flash column 

chromatography. NMR spectra were recorded at 300 MHz for *H NMR, 75.5 MHz for ^^C NMR 
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and 121 .5 MHz for ^^P NMR on a Varian Unity 300 spectrometer. 5- Values are in ppm relative 
to tetramethyl silane as internal standard (^H and ^"^C NMR) and relative to 85% H3PO4 as 
external standard (^^P NMR). Coupling constants are given in Hertz. The assignments, when 
given, are tentative, and the assignments of methylene protons, when given, may be 
5 interchanged. Bicyclic compounds are named according to the Von Bayer nomenclature. Fast 
atom bombardment mass spectra (FAB-MS) were recorded in positive ion mode on a Kratos 
MS50TC spectrometer. The composition of the oligonucleotides were verified by MALDI-MS 
on a Micromass Tof Spec E mass spectrometer using a matrix of diammonium citrate and 2,6- 
dihydroxyacetophenone. 

10 

Example 28: Synthesis of Ijl-O-Isopropylidene-S-O-methanesulfonyM-C- 
methanesulfonyloxymethyl-3-0-(p-methoxybenzyl)-a-D-nbofuranose [Compound 2 in 
Scheme 1 above] 

Mesyl chloride (8.6 g, 7.5 mmol) was dropwise added to a stirred solution of 4-C- 
15 hydroxymethyl-1 ,2-0-isopropylidene-3-0-p-methoxybenzyl-a-D-ribofuranose [R. Yamaguchi, 
T. Imanishi, S. Kohgo, H. Horie and H. Ohrui, Biosci. BiotechnoL Biochem., 1999, 63, 736] (1, 
10.0 g, 29.4 mmol) in anhydrous pyridine (30 cm^) and the reaction mixture was stirred 
overnight at room temperature. The mixture was evaporated to dryness under reduced pressure to 
give a residue which was co-evaporated with toluene (2 x 25 cm"^), dissolved in CH2CI2 (200 
20 cm^) and washed successively with saturated aqueous NaHCOs (2 x 100 cm^) and brine (50 
cm^). The organic phase was dried (Na2S04), filtered and evaporated to dryness under reduced 
pressure. The colorless viscous oil obtained was purified by column chromatography [0.5-1% 
(v/v) MeOH in CH2CI2 as eluent], followed by crystallization from MeOH to give furanose 2 as 
a white soHd material (13.6 g, 93%); i?/0.57 (CH2Cl2/MeOH 95:5, v/v); Sh (CDCI3) 7.30 (2 H, d, 
25 JSJl 6.90 (2 H, d, J8.5), 5.78 (1 H, d, / 3.7), 4.86 (1 H, d, J 12.0), 4,70 (1 H, d, J 1 1.4), 4.62 (1 
H, dd,y5.0 and 3.8), 4.50(1 H, d, J 11.1), 4.39(1 H,d, 712.3), 4.31 (1 H, d, 7 11.0), 4.17(1 H, 
d,J5.1), 4.11 (1 H, d,yil.0),3.81 (3 H, s), 3.07 (3 H, s), 2.99 (3 H, s), 1.68 (3 H, s), 1.34(3 H, 
s);5c(CDCl3) 159.8, 129.9, 128.8, 114.1, 114.0, 104.5,83.2, 78.0, 77.9, 72.6, 69.6, 68.8, 55.4, 
38.1,37.5,26.3, 25.7. 

30 
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Example 29: Synthesis of Methyl 5-0-inethanesulfonyl-4-C-inethanesulfonyloxyinethyl-3- 
0-(p-inethoxybenzyl)-D-ribofuranoside [Compound 3 in Scheme 1 above] 

A suspension of furanoside 2 (13.5 g, 27.2 mmol) in a mixture of H2O (45 cm^) and 15% 
HCl in MeOH (450 cm^ w/w) was stirred at room temperature for 72 h. The mixture was 
5 carefully neutralized by addition of saturated aqueous NaHCOs (1 00 cm^) followed by NaHCOs 
(s) whereupon the mixture was evaporated to dryness under reduced pressure. H2O (lOOcm^) was 
added, and extraction was performed with EtOAc (3 x 100 cm^). The combined organic phase 
was washed with brine (1 00 cm^), dried (Na2S04), filtered and then evaporated to dryness under 
reduced pressure. The residue was coevaporated with toluene (2 x 25 cm^) and purified by 

10 column chromatography [1-2% (v/v) MeOH in CH2CI2] to give furanoside 3 as an anomeric 
mixture (clear oil, 1 1 .0 g, 86%, ratio between anomers ca. 6:1); Rf 039, 0.33 (CH2Cl2/MeOH 
95:5, v/v); 4 (CDCI3, major anomer only) 7.28 (2 H, d, J 8.4), 6.91 (2 H, d, J 8.9), 4.87 (1 H, s), 
4.62(1 H,d,y 11.4), 4.53 (1 H, d,y 11.2), 4.41 (2 H, s),4.31 (1 H, d, 79.8), 4.24 (1 H, d,J4.6), 
4.06 (1 H, d, J 10.0), 3.98 (1 H, br s), 3.81 (3 H, s), 3.33 (3 H, s), 3.06 (3 H, s), 3.03 (3 H,s); So 

15 (CDCI3, major anomer only) 160.0, 130.1, 128.5, 1 14.3, 107.8, 81.7, 81.2, 73.8, 73.6, 69.7, 69.6, 
55.5, 55.4, 37.5, 37.4. 

Example 30: Synthesis of (lR,3RS,4R,7S)-l-MethanesuIfonyloxymethyl-3-methoxy-7-(p- 
niethoxybenzyloxy)-2,5-dioxabicyclo[2.2.1]heptane [Compound 4 in Scheme 1 above] 

20 To a stirred solution of the anomeric mixture of Compound 3 (10.9 g, 23.2 mmol) in 

anhydrous DMF (50 cm^) at 0 ^C was during 10 min added sodium hydride (2.28 g of a 60% 
suspension in mineral oil (w/w), 95.2 mmol) and the mixture was stirred for 12 h at room 
temperature. Ice-cold H2O (200 cm^) was slowly added and extraction was performed using 
EtOAc (3 X 200 cm ). The combined organic phase was washed successively with saturated 

25 aqueous NaHCOs (2 x 100 cm^) and brine (50 cm^), dried (Na2S04), filtered and evaporated to 
dryness under reduced pressure. The residue was purified by column chromatography [0.5-1% 
(v/v) MeOH in CH2CI2 ] to give first the major isomer (6.42 g, 74%) and then [1.5% (v/v) 
MeOH in CH2CI2 ] the minor isomer (1.13 g, 13%), both as clear oils; RfO.56, 0.45 
(CH2Cl2/MeOH 95:5, v/v); Sh (CDCI3 , major isomer) 7. 1 6 (2 H, d, J 8.8), 6.74 (2 H, d, J 8.4), 

30 4.65 (1 H, s), 4.42-4.32 (4 H, m), 3.95-3.94 (2 H, m), 3.84 (1 H, d, J7.4), 3.66 (3 H, s), 3.54 (1 
H, d,y7.4), 3.21 (3 H, s), 2.90 (3 H, s); (CDCI3, major isomer) 159.6, 129.5,129.3, 114.0, 
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105.3, 83.2, 78.6, 77.2, 72.1, 71.8, 66.3, 55.6, 55.4, 37.8; Sh (CDCI3 , minor isomer) 7.27 (2 H, d, 
J8.9), 6.89 (2 H, d, 78.6), 4.99 (1 H, s), 4.63-4.39 (4 H, m), 4.19 (1 H, s), 4.10-3.94 (2 H, m), 
3.91 (1 H, s), 3.81 (3 H, s), 3.47 (3 H, s), 3.05 (3 H, s); 4 (CDCI3, minor isomer) 159.7, 129.6, 
129.5, 1 14.1, 104.4, 86.4, 79.3, 77.1, 72.3, 71.9, 66.2, 56.4, 55.4, 37.7. 

5 

Example 31: Synthesis of (lR,4R,7S)-l-AcetoxymethyI-3-methoxy-7-(p-methoxybenzyloxy)- 
2,5-dioxabicyclo[2.2.1]heptane [Compound 5 in Scheme 1] 

To a stirred solution of furanoside 4 (major isomer, 6.36 g, 17.0 nmiol) in dioxane (25 
cm^) was added 18-crown-6 (9.0 g, 34.1 mmol) and KOAc (8.4 g, 85.6 mmol). The stirred 

10 mixture was heated under refluxed for 12 h and subsequently evaporated to dryness under 

reduced pressure. The residue was dissolved in CH2CI2 (100 cm^) and washing was performed, 
successively, with saturated aqueous NaHCOa (2 x50 cm^) and brine (50 cm''). The separated 
organic phase was dried (Na2S04), filtered and evaporated to dryness under reduced pressure. 
The residue was purified by column chromatography [1% (v/v) MeOH in CH2CI2] to give 

1 5 fiiranoside 5 as a white solid material (one anomer, 5 .23 g, 9 1 %); Rf 0.63 (CH2Cl2/MeOH 95 :5, 
v/v); 5h (CDCI3) 7.27-7.24 (2 H, m), 6.90-6.87 (2 H, m), 4.79 (1 H, s), 4.61 (1 H, d, J 1 1 .0), 4.49 
(2 H, m), 4.28 (1 H, d, J 1 1 .0), 4.04 (3 H, m), 3.80 (3 H, s), 3.68 (1 H, m), 3.36 (3 H, s), 2.06 (3 
H, s); 8t (CDCI3) 170.7, 159.5, 129.5, 129.4, 1 13.9, 105.1, 83.3, 78.9, 77.2, 72.0, 71.9, 61.0, 
55.4, 55.3, 20.8. 

20 

Example 32: Synthesis of (lS,4R,7S)-l-Hydroxymethyl-3-methoxy-7-(p- 
methoxybenzyIoxy)-2,5-dioxabicycIo|2.2.1]heptane [Compound 6 in Scheme 1] 

A solution of furanoside 5 (one anomer, 5.16 g, 15.3 mmol) in saturated methanolic 
ammonia (200 cm'') was stirred at room temperature for 48 h. The reaction mixture was 

25 evaporated to dryness under reduced pressure, coevaporated with toluene (2 x 50 cm^), and the 
residue purified by colunm chromatography [2-3% (v/v) MeOH in CH2CI2 ] to give furanoside 6 
as a white solid material (one anomer, 3.98 g, 88%); Rf 0.43 (CH2Cl2/MeOH 95:5, v/v); 5h 
(CDCI3) 7.27 (2 H, d,y8.6), 6.88 (2 H, d,78.9), 4.79 (1 H, s), 4.59 (1 H, d, J 1 1.3), 4.53 (1 H, d, 
J 1 1.4), 4.09 (2 H, s), 3.97 (1 H, d, J7.5), 3.86 (2 H, br s), 3.80 (3 H, s), 3.75-3.62 (2 H, m), 3.37 

30 (3 H, s); 5c (CDCI3) 159.4, 129.7, 129.3, 1 13.9, 105.2, 85.6, 78.3, 77.4, 71.9, 71.8, 58.8, 55.5, 
55.3. 
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Example 33: (lS,4R,7S)-3-Methoxy-7-(p-methoxybenzyIoxy)-l-(p- 
methoxybenzyloxymethyl)-2,5-dioxabicyclo[2.2.1]heptane [Compound 7 in Scheme 1] 

To a stirred solution of furanoside 6 (one anomer, 3.94 g, 13.3 mmol) in anhydrous DMF 
5 (50 cm"^) at 0 °C was added a suspension of NaH [60% in mineral oil (w/w), 1 .46 g, 60.8 mmol] 
followed by dropwise addition of />-methoxybenzyl chloride (2.74 g, 1 7.5 mmol). The mixture 
was allowed to warm to room temperature and stirring was continued for another 4 h whereupon 
ice-cold H2O (50 cm*^) was dropwise added. The mixture was extracted with CH2CI2 (3 x 100 
cm"^) and the combined organic phase was washed with brine (100 cm*^), dried (Na2S04), filtered, 
10 evaporated to dryness under reduced pressure and coevaporated with toluene (3 x 50 cm ). The 
residue (4.71 g) tentatively assigned as a mixture of 7 and aldehyde 11 was used in the 
preparation of 11 (see below) without further purification. 

Example 34: 4-C-MethanesuIfonyIoxymethyI-3,5-di-0-(p-methoxybenzyI)-l,2-0- 
15 isopropylidene-a-D-ribofuranose [Compound 9 in Scheme 1] 

4-C-Hydroxymethyl-3,5-di-0-(p-methoxybenzyl)-l,2-(9-isopropylidene-a-D- 
ribofuranose [R. Yamaguchi, T. Imanishi, S. Kohgo, H. Hone and H. Ohrui, Biosci, Biotechnol 
Biochem., 1999, 63, 736] (8, 3.2 g, 6.95 mmol) was mesylated using MsCl (2.00 g, 17.5 mmol) 
and pyridine (10 cm ) following the procedure described for 2. After work-up, the colorless 

20 viscous oil was purified by column chromatography [1% (v/v) MeOH in CH2CI2] to give 

derivative 9 in 89% yield (3.17 g) as a clear oil; /J/0.45 (CH2Cl2/MeOH 98:2, v/v); 6h (CDCI3) 
7.22 (2 H, d, J8.9), 7.18 (2 H, d, J8.7), 6.86 (4 H, d, J 8.3), 5.76 (1 H, d, J3.8), 4.83 (1 H, d, J 
12.0), 4.64 (1 H, d, J 1 1.6), 4.59 (1 H, m), 4.49-4.35 (4 H, m), 4.24 (1 H, d, J5.3), 3.80 (6 H, s), 
3.56(1 H, d,y 10.5), 3.45(1 H, d, 7 10.5), 3.06 (3 H, s), 1.67 (3 H, s), 1.33(3 H, s); 5c (CDCI3) 

25 159.6, 159.4, 129.9, 129.8, 129.7, 129.5, 129.4, 129.3, 1 14.0, 1 13.9, 1 13.8, 1 13.7, 1 13.6, 104.5, 
84.9, 78.6, 78.1, 73.4, 72.4, 71.0, 69.9, 55.3, 38.0, 26.4, 25.9. 

Example 35: Methyl 4-C-methanesulfonyloxymethyl-3,5-di-0-(p-methoxybenzyl)-D- 
ribofuranose [Compound 10 in Scheme 1] 

30 Methanolysis of furanoside 9 (3.1 g, 5.76 mmol) was performed using a mixture of a 

solution of 15% HCl in MeOH (w/w, 120 cm^) and H2O (12 cm^) following the procedure 


described for the synthesis of 3. After work-up, the crude product was purified by column 
chromatography [0.5-1% (v/v) MeOH in CH2CI2] to give the major anomer of 10 (1 .71 g, 58%) 
and [1-1.5% (v/v) MeOH in CH2CI2] the minor anomer of 10 (0.47 g, 16%), both as clear oils; Rf 
0.31, 0.24 (CH2Cl2/MeOH 98:2, v/v); 4 (major anomer, CDCI3) 159.8, 159.5, 129.9, 129.8, 
5 129.6, 129.5, 129.0, 114.2, 114.1, 114.0, 113.9, 107.9, 84.7, 79.9, 74.2, 73.5, 73.5, 70.2, 64.4, 
55.6, 55.4, 37.4. 

Example 36: Alternative preparation of Compound 7 in Scheme 1 

Ring closure of furanoside 10 (major anomer, 1.68 g, 3.28 mmol) was achieved using 
10 NaH (60% suspension in mineral oil (w/w), 0.32 g, 13.1 mmol) in anhydrous DMF (10 cm^) 
following the procedure described for the synthesis of 4 to give a crude product tentatively 
assigned as a mixture of furanoside 7 and aldehyde 11 (see below) (1.13 g). 

Example 37: (2R,3S54S)-4-Hydroxy-3-(p-methoxybenzyloxy)-4-(p- 
15 methoxybenzyloxymethyl)-tetrahydrofuran-2-carbaldehyde [Compound 11 in Scheme 1] 

A solution of crude furanoside 7 (as a mixture with 11 as prepared as described above, 
5.80 g) in 80% glacial acetic acid (100 cm^) was stirred at 50 ^C for 4 h. The solvent was 
distilled off under reduced pressure and the residue was successively coevaporated with absolute 
ethanol (3 x 25 cm^) and toluene (2 x 25 cm*^) and purified by column chromatography [4-5% 
20 (v/v) MeOH in CH2CI2 ] to give aldehyde 11 as a coloriess oil (4.60 g); /?/0.37 (CHaCh/MeOH 
95:5, v/v); <5h (CDCI3) 9.64 (1 H, br s), 7.27-7.17 (4 H, m), 6.87>6.84 (4 H, m), 4.59 (1 H, d, / 
11.6), 4,51-4.41 (2 H, m), 4.35 (1 H, s), 3.92-3.90 (2 H, m), 3.79 (6 H, s), 3.77-3.68 (3 H, m), 
3.55 (2 H,brs); 4 (CDCI3) 203.6, 159.5, 159.4, 129.7, 129.6, 129.5, 129.2, 114.0, 113.9, 113.8, 
87.3, 86.7, 81.0, 75.1, 73.4, 71.6, 67.6, 55.3. 

25 

Example 38: General procedure for the reaction of aryl magnesium bromides with 
aldehyde 11 to give Compounds 12a-e in Scheme 2 

A solution of aldehyde 11 (Scheme 2) in anhydrous THF (10 cm^) was added dropwise 
during 5 min to a stirred solution of the aryl magnesium bromide dissolved in anhydrous THF at 
30 0 °C. The mixture was allowed to heat to room temperature and stirred for 12 h. The mixture was 
evaporated to dryness under reduced pressure and the residue diluted with CH2CI2 and washed 
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several times with saturated aqueous NH4CI. The organic phase was dried (Na2S04), filtered, and 
evaporated to dryness under reduced pressure. Column chromatography of the crude product 
obtained afforded the compounds 12a-e as shown in Scheme 2. 


5 Example 38a: Synthesis of (2S^S,4S)-4-Hydroxy-2-[(R)-hydroxy(phenyl)methyI]-4-(p- 
methoxybenzyloxy)-3-(p-methoxybenzyloxymethyl) tetrahydrofuran [Compound 12a of 
Scheme 2] 

Grignard reaction of phenylmagnesium bromide (1.0 M solution in THF, 14.2 cm', 14.2 
mmol) with aldehyde 11 (515 mg, 1.28 mmol) afforded 12a as shown in Scheme 2. The crude 

10 product was purified by column chromatography [4% (v/v) MeOH in CH2CI2] to give 

tetrahydrofuran 12a (540 mg, 88%) as a colorless oil; Rf034 (CHzCb/MeOH 95:5, v/v); Sh 
(CDCI3) 7.40-7.19 (7 H, m), 6.91-6.73 (6 H, m), 4.73 (1 H, d, J 6.4% 4.48 (2 H, s), 4.08 (2 H, s), 
3.88 (1 H, d, y 9.4), 3.79 (1 H, m), 3.78 (3 H, s), 3.76 (3 H, s), 3.75-3.69 (2 H, m), 3.50 (1 H, d, J 
9.4), 3.45 (1 H, s), 3.42 (1 H, br s), 3.26 (1 H, br s); 4 (CDCI3) 159.5, 159.3, 140.7, 129.7, 129.6, 

15 129.5, 129.2, 128.5, 128.0, 127.3, 113.9, 113.8, 113.7, 89.4, 84.6, 81.8, 75.3, 74.7, 73.5, 71.6, 
69.3, 55.3; m/z (FAB) 503 [M+Na]"", 479 [M-H]"", 461 [M-H-HiO]^. 


Example 38b: Synthesis of (2S^S,4S)-4-Hydroxy-2-[(R)-hydroxy(4-nuoro-3- 
methyIphenyI)-methyl]-4-(p-methoxybenzyloxy)-3-(p- 
20 methoxybenzyloxymethyl)tetrahydrofuran [Compound 12b of Scheme 2] 

Grignard reaction of 4-fluoro-3-methylphenylmagnesium bromide (1.0 M solution in 
THF, 15.0 cm', 15.0 mmol) with aldehyde 11 (603 mg, 1.5 mmol) afforded 12b as shown in 
Scheme 2. The crude product was purified by column chromatography [4-5% (v/v) MeOH in 
CH2CI2] to give tetrahydrofuran 12b (61 1 mg, 85%) as a colorless oil; R/OM (CHzCh/MeOH 

25 95:5, v/v); Sh (CDCI3) 7.24-7.12 (5 H, m), 6.98-6.84 (5 H, m), 6.77 (1 H, d, y 8.5), 4.65 (1 H, dd, 
72.8 and 6.4), 4.49 (2 H, s), 4.15 (2 H, s), 4.01 (1 H, dd, 72.3 and 6.5), 3.87 (1 H, d, 79.3), 3.79 
(3H, s), 3.78 (3 H, s), 3.76-3.68 (2 H, m), 3.52 (1 H, s), 3.47 (1 H, d, 7 10.3), 3.42 (1 H, d, 72.9), 
3.22 (1 H, s), 2.24 (3 H, d,70.8); 4 (CDCI3) 162.7, 159.5, 159.4, 136.2, 136.1, 130.3, 130.2, 
129.7, 129.6, 129.5, 129.4, 129.1, 126.1, 126.0, 115.1, 114.8, 114.0, 113.9, 113.8, 89.3,84.5, 

30 81.8,75.3,74.0,73.5,71.7,69.2,55.4,55.3, 14.7 (d, 73.9); m/z (FAB) 535 [M+Na]^ 51 1 [M- 
H]*, 493 [M-H-H2O]*. 
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Example 38c: Synthesis of (2S,3S,4S)-4-Hydroxy-2-[(R)-hydroxy(l-naphtyl)methyl]-4-(p- 
inethoxyben2yloxy)-3-(p-n[iethoxybenzyloxyinethyl) tetrahydrofuran [Compound 12c of 
Scheme 2] 

5 1-Bromonaphthalene (1 .55 g, 7.5 mmol) was added to a stirred mixture of magnesium 

turnings (1 82 mg, 7.5 mmol) and iodine (1 0 mg) in THF (10 cm^). The mixture was stirred at 40 

for 1 h whereupon it was allowed to cool to room temperature. A solution of aldehyde 11 
(603 mg, 1 .5 mmol) in THF (10 cm ) was added slowly and the reaction was stirred for 12 h. The 
crude product was purified by column chromatography [4-5% (v/v) MeOH in CH2CI2] to give 

10 tetrahydrofuran 12c (756 mg, 95%) as a colorless oil; Rf035 (CH2Cl2/MeOH 95:5, v/v); 5n 
(CDCI3) 8.08 (1 H, m), 7.86(1 H,m), 7.79(1 H, d,y8.2), 7.72(1 H, d, 7 7.2), 7.49-7.44 (3H, 
m), 7,18 (2 H, d, J8.4), 6.84 (2 H, d, J8.6), 6.74 (2 H, d, J 8.7), 6.68 (2 H, d, 78.8), 5.52 (1 H, 
dd,y3.7 and 5.6), 4.45 (2 H,s), 4.34(1 H,dd,/2.5 and 5.9), 4.03 (1 H, d,y 11.0), 3.96 (1 H,d, 
y 11.0), 3.93 (1 H, d,y9.5), 3.80 (1 H, d,y9.3), 3.77 (3 H, s), 3.75 (1 H, d,y2.6), 3.72 (3 H, s), 

15 3.68(1 H,d,J9.3), 3.56(1 H, d, 73.7), 3.49 (1 H, d, J9.3), 3.34 (1 H, s); 4 (CDCI3) 159.5, 

159.3, 136.3, 134.0, 131.0, 129.7, 129.6, 129.5, 129.4, 129.0, 128.6, 128.2, 125.6, 125.5, 123.5, 
1 14.0, 1 13.8, 1 13.7, 88.7, 84.7, 81.9, 75.5, 73.5, 71.7, 71.3, 69.3, 55.4, 55.3; m/z (FAB) 553 
[M+Na]*, 529 [M-H]^, 51 1 [M-H-H2O]*. 


20 Example 38d: (2S,3S,4S)-4-Hydroxy-2-[(R)-hydroxy(l-pyrenyl)methyl]-4-(p-methoxy- 
benzyloxy)-3-(p-methoxybenzyloxymethyl)tetrahydrofuran [Compound 12d of Scheme 2] 

Tetrahydrofuran 12d was synthesized from aldehyde 11 (515 mg, 1.28 mmol), 1- 
bromopyrene (1.0 g, 3.56 mmol), magnesium turnings (155 mg, 6.4 mmol), iodine (10 mg ) and 
THF (20 cm'') following the procedure described for synthesis of compound 12c. The crude 

25 product was purified by column chromatography [3-4% (v/v) MeOH in CH2CI2] to give 

tetrahydrofuran 12d (690 mg, 89%) as a pale yellow solid; /?/0.35 (CH2Cl2/MeOH 95:5, v/v); Sh 
(CDCI3) 8.23 (2 H, d,y 8.4 and 9.2), 8.19-8.13 (3 H, m), 8.05-7.99 (4 H. m), 7.14 (2 H, d, J 8.8), 
6.82 (2 H, d, J9.0), 6.30 (2 H, d, 78.7), 6.20 (2 H, d, 7 8.6), 5.87 (1 H, d, 77.2), 4.43 (2 H, s), 
4.41 (1 H, m),4.01 (1 H, d, 79.4), 3.91 (1 H, d, 7 11.8), 3.86(1 H, d, 79.2), 3.77 (1 H, d,71.9), 

30 3.76 (3 H, s), 3.70-3.64 (3 H, m), 3.52-3.45 (1 H, m), 3.44 (3 H, s); 4 (CDCI3) 159.5, 158.9, 

133.9, 131.4, 131.1, 130.7, 129.7, 129.5, 129.2, 128.9, 128.5, 127.8, 127.7, 127.5, 126.0, 125.5, 
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125.3, 125.2, 125.1, 125.0, 124.9, 122.9, 113.9, 113.3, 89.5, 83.5, 82.0, 75.7, 73.4, 71.3, 71.0, 
69.3, 55.3, 55.0; m/z (MALDI) 627 [M+Na]^, 609 [M"'+Na-H20]''. 


Example 38e: (2S,3S,4S)-4-Hydroxy-2-[(R)-hydroxy(2,4,5-trimethyIphenyl)methyI]-4-(p. 
5 methoxybenzyloxy)-3-(p-methoxybenzyloxymethyl) tetrahydrofuran ICompound 12e of 
Scheme 2] 

Tetrahydrofuran 12e was synthesized from aldehyde 11 (515 mg, 1.28 nrniol), 1-bromo- 
2,4,5-trimethylbenzene (1.28 g, 6.4 mmol), magnesium turnings (155 mg, 6.4 mmol), iodine (10 
mg) and THF (20 cm ) following the procedure described for synthesis of compound 12c. The 

10 crude product was purified by column chromatography [3-4% (v/v) MeOH in CH2CI2] to give 
tetrahydrofuran 12e (589 mg, 88%) as a colorless oil; /?/0.34 (CH2Cl2/MeOH 95:5, v/v); 5h 
(CDCI3) 7.25 (2 H, d,y8.7), 7.21 (2 H, d, J8.9), 6.90 (1 H, s), 6.87 (1 H, s), 6.85 (2 H, d, J8.9), 
6.76 (2 H, d, J8.7), 4.95 (1 H, dd, 7 3.6 and 5.9), 4.48 (2 H, s), 4.18-4.08 (3 H, m), 3.89 (1 H, d, 
79.6), 3.80 (1 H, m), 3.79 (3 H, s), 3.77 (3 H, s), 3.71 (1 H, d,y9.2), 3.64 (1 H, d,72.6), 3.51 (1 

15 H, d, J9.4), 3.24 (1 H, s), 3.18 (1 H, d, J3.4), 2.25 (3 H,s), 2.22 (3 H,s), 2.21 (3 H, s); 4 

(CDCI3) 159.5, 159.3, 136.0, 135.8, 134.2, 132.5, 132.0, 129.8, 129.7, 129.6, 129.5, 128.5,113.9, 
1 13.8, 88.6, 84.7, 81.7, 75.4, 73.5, 71.7, 70.9, 69.4, 55.3, 19.5, 19.4, 19.0; m/z (FAB) 545 
[M+Na]^, 521 [M-H]^, 503 [M-H-HzO]^ 

20 Example 39: General procedure for the cyclization of 12a-e to give compounds 13a-e as 
shown in Scheme 2. 

iV,iV,Ar^-Tetramethylazodicarboxamide (TMAD) was added in one portion to a stirred 
solution of the compounds 12a-e as shown in Scheme 2 and tributylphosphine in benzene at 0 
°C. The mixture was stirred for 12 h at room temperature whereupon it was diluted with diethyl 
25 ether (50 cm ). The organic phase was washed successively with saturated aqueous NH4CI (2 x 
20 cm^) and brine (25 cm^), dried (Na2S04), filtered and evaporated to dryness under reduced 
pressure. The crude product obtained was purified by column chromatography [1 .5-2% (v/v) 
MeOH in CH2CI2] to give compounds 13a-e as shown in Scheme 2. 
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Example 39a: (lS,3S,4R,7S)-7-(p-Methoxyben2yloxy)-l-(p-methoxybenzyloxyinethyl)-3- 
phenyI-2,5-dioxabicyclo[2.2.11heptane [Compound 13a of Scheme 2] 

Cyclization of compound 12a (540 mg, 1.13 mmol) in the presence of TMAD (310 mg, 
1.8 mmol), PBu3 (364 mg, 1 .8 mmol) and benzene (10 cm^) followed by the general work-up 
5 procedure and colunm chromatography afforded compound 13a as a colorless oil (400 mg, 
77%); i?/0.51 (CHzCh/MeOH 98:2, v/v); Sh (CDCI3) 7.36-7.33 (7 H, m), 7.10 (2 H, d,y8.3), 
6.88 (2 H, d, J8.7), 6.78(2 H, d, 78.7), 5.17 (1 H, s, H-3), 4.59 (2 H, br s, -CH2(MPM)), 4.43 (1 

H, d,yil.3,-CH2(MPM)),4.34 (1 H, d, J 11.3, -CH2(MPM)), 4.19 (1 H, s, H-4), 4.09(1 H,d,7 
7.7, H-6), 4.06 (1 H, d, J7.7, H-6), 4.01 (1 H, s, H-7), 3.82-3.77 (5 H, m, -C1-CH2-O-, OCH3), 

10 3.76 (3 H, s, -OCH3); do (CDCI3) 159.4, 159.3, 139.4 (C-1'), 130.3, 129.7, 129.5, 129.3, 128.5, 
127.5, 125.4, 113.9, 113.8, 85.9 (C-1), 84.1 (C-3), 81.1 (C-4), 77.4 (C-7), 73.7 (-CH2(MPM)), 
73.4 (C-6), 71.8 (-CH2(MPM)), 66.3 (-C-CHj-O-), 55.4 (-OCH3), 55.3 (-OCH3); m/z (FAB) 467 
[M+Na-H20]^, 461 [M-H]*. 

15 Example 39b: (lS,3S,4R,7S)-3-(4-Fluoro-3-methylphenyl)-7-(p-methoxybenzyloxy)-l-(p- 
methoxybenzyloxymethyl)-2,5-dioxabicyclo[2.2.1]heptane [Compound 13b of Scheme 2] 
Cyclization of compound 12b (550 mg, 1.08 mmol) in the presence of TMAD (275 mg, 

I. 6 nrniol), PBu3 (325 mg, 1.6 nmiol) and benzene (10 cm^) followed by the general work-up 
procedure and column chromatography afforded compoimd 13b as a colorless oil (445 mg, 

20 84%); Rf 0.52 (CHzClj/MeOH 98:2, v/v); 5h (CDCI3) 7.28 (2 H, d, / 8.7 ), 7. 1 1 (2 H, d, J 8.6), 
7.08-7.09 (2 H, m, H-2' and H-6'), 6.94 (1 H, dd, J8.5 and 9.2, H-5'), 6.88 (2 H, d, 7 8.6), 6.79 
(2 H, d, J8.4), 5.08(1 H, s, H-3), 4.62-4.55 (2 H, m, -CH2(MPM)), 4.45 (1 H, d, 7 1 1.1, - 
CH2(MPM)), 4.36 (1 H, d, J 1 1.6, -CH2(MPM)), 4.13 (1 H, s, H-4), 4.07, 4.03 (1 H each, 2d, J 
7.6 each, H-6), 3.99 (1 H, s, H-7), 3.81 (2 H, m,-C,-CH2-0-), 3.80 (3 H, s,-0CH3), 3.77 (3 H, s,- 

25 OCH3), 2.23 (3 H, d, J 1.6, Ar-CHs); 4 (CDCI3) 162.3 (C-4'), 159.4, 159.3, 134.8, 134.7, 130.3, 
129.6, 129.5, 129.2, 128.5, 128.4, 128.3, 124.2, 115.1, 114.8, 113.9, 113.8, 85.9 (C-1), 83.5 (C- 
3), 81.0 (C-4), 77.1 (C-7), 73.6 (-CH2(MPM)), 73.4 (C-6), 71.8 (-CH2(MPM)), 66.2 (-C1-CH2-O- 
), 55.4 (-OCH3), 55.3 (-OCH3), 14.7 ( d, J3.3, Ar-CHa); m/z (FAB) 494 [M]*, 493 [M-H]*. 
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Example 39c: (lS,3S,4R,7S)-7-(p-Methoxybenzyloxy)-l-(p-inethoxybenzyloxymethyl)-3-(l- 
naphthyl)-2,5-dioxabicyclo[2.2.1]heptane (Compound 13c of Scheme 2] 

Cyclization of compound 12c (700 mg, 1 .32 mmol) in the presence of TMAD (345 mg, 
2.0 mmol), PBU3 (405 mg, 2.0 mmol) and benzene (15 cm^) followed by the general work-up 
5 procedure and column chromatography afforded compound 13c as a colorless oil (526 mg, 78%); 
RfO.53 (CHaCh/MeOH 98:2, v/v); Sh (CDCI3) 7.91-7.86 (2 H, m), 7.78 (1 H, d, 78.2), 7.73 (1 

H, d,y7.1), 7.53-7.46 (3 H, m), 7.32 (2 H, d, J8.7), 7.04 (2 H, d, J8.7), 6.90 (2 H, d,y8.3), 6.71 
(2 H, d, J8.6), 5.79 (1 H, s, H-3), 4.67-4.61 (2 H, m, -CH2(MPM)), 4.43 (1 H, s, H-4), 4.38 (1 H, 
d, J 1 1.2, -CH2(MPM)), 4.27 (1 H, d, J 10.9, -CH2(MPM)), 4.16 (2 H, br s, H-6), 4.08 (1 H, s, H- 

10 7), 3.91, 3.87 (1 H each, 2d, 7 1 1.0 each,-C,-CH2-0-), 3.81 (3 H, s,-0CH3), 3.72 (3 H, s,-0CH3); 
6i;(CDCl3) 159.3, 134.6 (C-r), 133.5, 130.3, 129.8, 129.7, 129.4, 129.3, 128.9, 128.1, 126.4, 
125.8, 125.6, 123.8, 122.7, 113.9, 113.7, 85.7 (C-1), 82.3 (C-3), 79.9 (C-4), 78.2 (C-7), 73.7 (- 
0CH2(MPM)), 73.5 (C-6), 71.8 (-0CH2(MPM)), 66.3 (-C-CHz-O-), 55.4 (-OCH3), 55.3 (- 
OCH3); m/z (FAB) 512 [Mf , 511 [M-Hf . 

15 

Example 39d: (lS,3S,4R,7S)-7-(p-Methoxybenzyloxy)-l-(p-methoxybenzyIoxymethyl)-3-(l- 
pyrenyl)-2,5-dioxabicyclo[2.2.1]heptane [Compound 13d of Scheme 2] 

Cyclization of compound 12d (650 mg, 1 .08 mmol) in the presence of TMAD (275 mg, 

I. 6 irmiol), PBu3 (325 mg, 1.6 mmol) and benzene (10 cm^) followed by the general work-up 
20 procedure and column chromatography afforded compound 13d as a pale yellow solid (496 mg, 

79%); Rf 0.53 (CHaCb/MeOH 98:2, v/v); Sh (CDCI3) 8.29 (1 H, d, 7 8.2), 8.18-8.12 (5 H, m), 
8.08-8.01 (2 H, m), 7.96 (1 H, d, J 7.5), 7.35 (2 H, d, J 8.5), 6.97 (2 H, d, J 8.9), 6.92 (2 H, d, J 
8.8), 6.60 (2 H, d, J8.8), 6.09 (1 H, s, H-3), 4.71-4.65 (2 H, m, -CH2(MPM)), 4.49 (1 H, s, H-4), 
4.34(1 H, d,J11.4, -CH2(MPM)), 4.23 (1 H, d,y 11.1, -CH2(MPM)), 4.25 (1 H, d,/7.6, H-6), 

25 4.21 (1 H, d, J7.8, H-6), 4.16 (1 H, s, H-7), 3.95-3.94 (2 H, m, -C1-CH2-O-), 3.81 (3 H, s,- 

OCH3), 3.59 (3 H, s,-0CH3); 4 (CDCI3) 159.4, 159.3, 132.2 (C-l'), 131.4, 130.8, 130.7, 130.4, 
129.5, 129.4, 128.0, 127.5, 127.4, 126.9, 126.1, 125.6, 125.4, 124.9, 124.8, 124.7, 123.6, 122.0, 
1 13.9, 113.7, 85.9 (C-l), 82.7 (C-3), 80.6 (C-4), 77.9 (C-7), 73.9 (-0CH2(MPM)), 73.5 (C-6), 
71.8 (-0CH2(MPM)), 66.3 (-C,-CH2-0-), 55.4 (-OCH3), 55.2 (-OCH3); m/z (FAB) 587 [M+Hf , 

30 586 [M]^ 
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Example 39e: (lS,3S54RJS)-7-(p-Methoxybenzyloxy)-l-(p-methoxybenzyloxyniethyl)-3- 
(254,5-trimethyIphenyl)-255-dioxa bicyclo[2.2.1]heptane [Compound 13e of Scheme 2] 

Cyclization of compound 12e (550 mg, 1.05 mmol) in the presence of TMAD (275 mg, 
1.6 mmol), PBu3 (325 mg, 1.6 mmol) and benzene (10 cm ) followed by the general work-up 
5 procedure and column chromatography afforded compound 13e as a colorless oil (425 mg, 80%); 
Rf 0.52 (CHsCb/MeOH 98:2, v/v); 5h (CDCI3) 7.30 (2 H, d, 79.0), 7.24 (1 H, s, H-6'), 7.13 (2 H, 
d, / 8.9), 6.89 (1 H, s, H-3'), 6.88 (2 H, d, J 8.8), 6.79 (2 H, d, J 8.6), 5. 1 8 (1 H, s, H-3), 4.64- 
4.57 (2 H, m, -CH2(MPM)), 4.46 (1 H, d, 7 1 1.2, -CH2(MPM)), 4.36 (1 H, d, / 1 1.5, - 
CH2(MPM)), 4.18 (1 H, s, H-4), 4.14 (1 H, s, H-7), 4.09 (1 H, d, J7.9, H-6), 4.04 (1 H, d, /7.7, 
10 H-6), 3.86 (2 H, s,-Ci-CH2-0-), 3.80 (3 H, s,-0CH3), 3.76 (3H, s,-0CH3), 2.21 (6 H, s, 2 x Ar- 
CH3), 2.17 (3 H, s, Ar-CH3); 4 (CDCI3) 159.4, 159.3, 135.5 (C-1'), 134.4, 134.0, 131.7, 131.3, 
130.5, 129.9, 129.4, 129.2, 127.2, 113.9, 113.8, 85.6 (C-1), 82.4 (C-3), 79.4 (C-4), 77.6 (C-7), 
73.5 (-0CH2(MPM)), 73.4 (C-6), 71.8 (-0CH2(MPM)), 66.3 (-C1-CH2-O-), 55.4 (-OCH3), 55.3 
(-OCH3), 19.5 (-CH3), 19.3 (-CH3), 18.4 (-CH3); m/z (FAB) 504 [M]"", 503 [M-H]^. 

15 

Example 40: General procedure for the oxidative removal of the p-methoxybenzyl groups 
to give Compounds 14a-e as shown in Scheme 2. 

To a stirred solution of Compound 13a-e in CH2CI2 (containing a small amount of H2O) 
at room temperature, was added 2,3-dichloro-5,6-dicyanoquinone (DDQ) which resulted in an 

20 immediate appearance of a deep greenish-black color which slowly faded into pale brownish- 
yellow. The reaction mixture was vigorously stirred at room temperature for 4 h. The precipitate 
was removed by filtration through a short pad of silica gel and washed with EtOAc. The 
combined filtrate was washed, successively, with saturated aqueous NaHC03 (2 x 25 cm^) and 
brine (25 cm^). The separated organic phase was dried (Na2S04), filtered and evaporated to 

25 dryness under reduced pressure. The crude product obtained was purified by column 
chromatography [4-5% (v/v) MeOH in CH2CI2] to give compounds 14a-e. 

Example 40a: (lS,3S,4R,7S)-7-Hydroxy-l-hydroxymethyl-3-phenyl-2,5-dioxabicycIo[2.2.1]- 
heptane [Compound 14a of Scheme 2] 

30 Compound 13a (400 mg, 0.86 mmol) was treated with DDQ (600 mg, 2.63 mmol) in a 

mixture of CH2CI2 (10 cm^) and H2O (0.5 cm^). After the general work-up procedure and column 
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chromatography, compound 14a was obtained as a white solid material (128 mg, 66%); /?/0.30 
(CH2Cl2/MeOH 9:1, v/v); 5h ((CD3)2C0/CD30D; (CD3)2C0 was added to the compound 
followed by addition of CD3OD until a clear solution appeared) 7.40-7.22 (5 H, m), 4.99 (1 H, s), 
4.09(1 H,s), 4.04(1 H, s), 4.01 (1 H, d, 77.7), 3.86 (1 H, d, J 7.7), 3.90 (2 H,brs),3.77 (2 H,br 
5 s); dc ((CD3)2CO/CD30D; (CD3)2CO was added to the compound followed by addition of 

CD3OD until a clear solution appeared) 140.0, 128.2, 127.2, 125.4, 87.2, 83.7, 83.5, 72.3, 70.2, 
58.4; m/z (FAB) 223 [M+H]". 

Example 40b: (lS,3S,4R,7S)-3-(4-Fluoro-3-methylphenyl)-7-hydroxy-l-hydroxymethyl- 
10 2,5-dioxabicycIo[2.2.1]heptane [Compound 14b of Scheme 2] 

Compound 13b (400 mg, 0.81 mmol) was treated with DDQ (570 mg, 2.5 mmol) in a 
mixture of CH2CI2 (10 cm^) and H2O (0.5 cm^). After the general work-up procedure and colunm 
chromatography, compound 14b was obtained as a white solid material (137 mg, 67%); i?/0.31 
(CH2Cl2/MeOH9:l,v/v);6H(CD30D) 7.23 (1 H, d,y8.1), 7.19(1 H,m), 6.99(1 H,dd,y8.5 
15 and 9.3), 4.99 (1 H, s), 4.09 (1 H, s), 4.06 (1 H, s), 4.03 (1 H, d, 7 7.6), 3.93-3.91 (3 H, m), 2.25 
(3 H, d,71.4); 4(CD30D) 161.9 (d, J 243.3), 136.4 (d,y3.4), 129.6 (d.JS.O), 126.1 (d, J 22.8), 
125.5 (d, 78.0), 1 15.7 (d, 7 22.9), 88.5, 85.0, 84.3, 73.5, 71.3, 59.4, 14.5 (d,y 3.7); m/z (FAB) 
255 [M+H]^. 

20 Example 40c: (lS^S,4R,7S)-7-Hydroxy-l-hydroxymethyl-3-(l-naphthyl)-2,5-dioxabicyclo- 
[2.2.1]heptane [Compound 14b of Scheme 2] 

Compound 13c (475 mg, 0.93 mmol) was treated with DDQ (600 mg, 2.63 mmol) in a 
mixture of CH2CI2 (10 cm^) and H2O (0.5 cm^). After the general work-up procedure and column 
chromatography, compound 14c was obtained as a white solid material (170 mg, 67%); /?/0.31 

25 (CH2Cl2/MeOH 9:1, v/v); Sh (CDCI3/CD3OD; CD3OD was added to the compound followed by 
addition of CDCI3 until a clear solution appeared) 7.94-7.86 (2 H, m), 7.80-7.74 (2 H, m), 7.55- 
7.46 (3 H, m), 5.74 (1 H, s), 4.56 (2 H, br s), 4.37 (1 H, s), 4.24 (1 H, s), 4.17-4.1 1 (2 H, m), 4.04 
(2 H, br s); 4 (CDCI3/CD3OD; CD3OD was added to the compound followed by addition of 
CDCI3 until a clear solution appeared 134.7, 134.0, 130.2, 129.3, 128.6, 126.8, 126.2, 125.8, 

30 123.8, 122.8, 87.4, 83.1, 82.2, 73.1, 71.5, 59.0; m/z (FAB) 273 [M+H]^ 272 [Mf . 
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Example 40d: (lS,3S,4R,7S)-7-Hydroxy-l-hydroxymethyI-3-(l-pyrenyl)-2,5-dioxabicyclo- 
[2.2.1]heptane [Compound 14d of Scheme 2] 

Compound 13d (41 1 mg, 0.7 mmol) was treated with DDQ (570 mg, 2.5 mmol) in a 
mixture of CH2CI2 (10 cm^) and H2O (0.5 cm^). After the general work-up procedure and column 
5 chromatography, compound 14d was obtained as a white solid material (182 mg, 75%); ii/0.32 
(CH2Cl2/MeOH 9:1, v/v); Sh (CDCI3/CD3OD; CD3OD was added to the compound followed by 
addition of CDCI3 until a clear solution appeared) 8.32 (1 H, d, J 7.8), 8.23-8.18 (5 H, m), 8.06 
(2 H, br s), 8.01 (1 H, d, 77.6), 6.06 (IH, s), 4.47 (1 H, s), 4.36 (1 H, s), 4.27-4.18 (2 H, m), 4.10 
(2 H,br s); 4 (CDCI3/CD3OD) 132.2, 131.0, 128.5, 127.8, 127.3, 126.5, 125.9, 125.7, 125.1, 
10 123.6, 122.1, 87.7, 83.7, 82.6, 73.1, 71.4, 58.9; m/z (FAB) 347 [M+H]^ 346 [Mf . 

Example 40e: (lS,3S,4R,7S)-7-Hydroxy-l-hydroxymethyl-3-(2,4,5-trimethyIphenyI)-2,5- 

dioxabicycIo[2.2.1]heptane [Compound 14e of Scheme 2] 

Compound 13e (355 mg, 0.7 mmol) was treated with DDQ (570 mg, 2.5 mmol) in a 
15 mixture of CH2CI2 (10 cm^) and H2O (0.5 cm^). After the general usual work-up procedure and 

column chromatography, compound 14e was obtained as a white solid material (120 mg, 65%); 

i?/0.31 (CH2Cl2/MeOH 9:1, v/v); 5h (CDCI3/CD3OD; CD3OD was added to the compound 

followed by addition of CDCI3 until a clear solution appeared) 7.23 (1 H, s), 6.92 (1 H, s), 5.14 

(1 H, s), 4.26 (1 H, s), 4.10 (1 H, s), 4.08, (1 H, d,y 7.7), 4.00-3.95 (3 H, m), 2.23 (6 H, s), 2.21 
20 (1 H, s); Sc (CDCI3/CD3OD; CD3OD was added to the compound followed by addition of CDCI3 

until a clear solution appeared) 135.6, 133.9, 133.8, 131.7, 131.2, 126.6, 86.6, 82.1,81.9, 72.3, 

70.6, 58.5, 19.2, 19.0, 18.1; m/z (FAB) 265 [M+H]^ 264 [Mf. 

Example 41: General procedure for dimethoxytritylation of compounds 14a-e to give 
25 Compounds 15a-e as shown in Scheme 2. 

4,4'-Dimethoxytrityl chloride (DMTCl) was added in one portion to a stirred solution of 
compound 14a-e in anhydrous pyridine. After stirring the mixture at room temperature for 4 h, 
methanol (0.2 cm"^) was added and the resulting mixture was evaporated to dryness under 
reduced pressure. The residue was coevaporated with anhydrous CH3CN (2x5 cm^) and 
30 anhydrous toluene (2x5 cm"^) and then dissolved in CH2CI2 (20 cm^, traces of acid removed by 
filtration through a short pad of basic alumina). The resulting solution was washed, successively. 
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with saturated aqueous NaHCOs (2x10 cm^) and brine (10 cm^). The separated organic phase 
was dried (Na2S04), filtered and evaporated to dryness under reduced pressure. The crude 
product obtained was purified by column chromatography [0.25- 0.50% (v/v) MeOH in CH2CI2, 
containing 0.5% Et^N] affording compounds 15a-e. 

5 

Example 41a: (lR,3S,4R,7S)-lT(4,4'-DimethoxytrityIoxymethyl)-7-hydroxy-3-phenyl-2,5- 
dioxabicyclo[2.2.1]heptane [Compound 15a of Scheme 2] 

Dimethoxytritylation of compound 14a (108 mg, 0.49 mmol) using DMTCl (214 mg, 
0.63 mmol) in anhydrous pyridine (2 cm^) followed by the general work-up procedure and 

10 column chromatography afforded compound 15a as a white solid material (180 mg, 71%); Rf 
0.31 (CH2Cl2/MeOH 98:2, v/v); 5h (CDCI3) 7.66-7.21 (14 H, m), 6.84 (4 H, d,y8.8), 5.19 (1 H, 
s), 4.29 (1 H, s), 4.13 (1 H, s), 4.07 (1 H, d, J8.4), 4.01 (1 H, d,y8.3), 3.78 (6 H, s), 3.55 (1 H, 
d, J 10.2), 3.50 (1 H, d, J 10.7), 2.73 (1 H, br s); 4 (CDCI3) 158.6, 149.8, 144.9, 139.4, 136.2, 
135.9, 135.8, 130.3, 130.2, 128.5, 128.3, 128.0, 127.6, 126.9, 125.4, 123.9, 113.3, 86.4, 86.0, 

15 83.8, 83.4, 73.0, 71 .6, 60.2, 55.3; m/z (FAB) 525 [M+Hf , 524 [M]*. 

Example 41b: (lR,3S,4R,7S)-l-(4,4'-DimethoxytrityIoxymethyl)-3-(4-nuoro-3- 
methylphenyl)-7-hydroxy-2,5-dioxabicycio[2.2.1]heptane [Compound 15b of Scheme 2] 

Dimethoxytritylation of compound 14b (95 mg, 0.38 mmol) using DMTCl (129 mg, 0.42 

20 nrniol) in anhydrous pyridine (2 cm^ ) followed by the general work-up procedure and coliunn 
chromatography afforded compound 15b as a white solid material (126 mg, 61%); Rf 0.32 
(CHzCls/MeOH 98:2, v/v); 5h (CDCI3) 7.53-7.15 (1 1 H, m), 6.97 (1 H, dd, J8.7 and 8.9), 6.84 (4 
H, d, J 8.8), 5.11 (1 H, s), 4.26(1 H, d, 73.9), 4.08 (1 H, s), 4.03(1 H, d, 7 8.0), 3.95 (1 H, d,J 
8.0), 3.78 (6 H, s), 3.54 (1 H, d, J 10.5), 3.47 (1 H, d, J 10.1), 2.26 (3 H, d, J 1.5), 2.08 (1 H, br 

25 s); 4 (CDCI3) 160.8 (d, J244.1), 158.7, 144.9, 135.9, 134.7, 134.6, 130.3, 130.2, 130.1, 128.5, 
128.4, 128.3, 128.0, 127.0, 125.2, 124.9, 124.4, 124.3, 115.2, 114.9, 113.4, 86.5, 86.0, 83.7, 83.0, 
72.9, 71.7, 60.1, 55.3, 14.8 (d, J3.1); m/z (FAB) 556 [Mf . 
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Example 41c: lR^S,4R,7S)-l-(4,4*-Dimethoxytrityloxymethyl)-7-hydroxy-3-(l-naphthyl)- 
2,5-dioxabicyclo[2.2.1]heptane [Compound 15c of Scheme 2] 

Dimethoxytritylation of compound 14c (125 mg, 0.46 mmol) using DMTCl (170 mg, 0.5 
mmol) in anhydrous pyridine (2 cm^) followed by the general work-up procedure and column 
5 chromatography afforded compound 15c as a white solid material (158 mg, 60%); RfO.35 

(CHzCb/MeOH 98:2, v/v); 5h (CDCI3) 7.95-7.86 (3 H, m), 7.79 (1 H, d,y8.3), 7.58-7.41 (9 H, 
m), 7.35-7.23 (3 H, m), 6.86 (4 H, d.J 8.8), 5.80 (1 H, s), 4.36 (1 H, s), 4.32 (1 H, d,76.5), 4.17 
(1 H, d,78.3), 4.06 (1 H, d, J8.0), 3.78 (6 H, s), 3.62-3.56 (2 H, m), 2.00 (1 H, d, J6.6); 4 
(CDCI3) 158.7, 144.9, 136.0, 135.9, 134.5, 133.6, 130.3, 129.8, 129.0, 128.3, 128.2, 128.1, 
10 127.0, 126.5, 125.9, 125.6, 123.9, 122.6, 1 13.4, 86.6, 85.7, 82.5, 81.7, 73.1, 72.6, 60.2, 55.3; /n/z 
(FAB) 575 [M+H]^ 574 [M]*. 

Example 41d: (lR,3S,4R,7S)-l-(4,4'-DimethoxytrityloxymethyI)-7-hydroxy-3-(l-pyrenyl)- 
2,5-dioxabicyclo[2.2.1]heptane [Compound 15d of Scheme 2] 

1 5 Dimethoxytritylation of the compound 14d (1 30 mg, 0.38 mmol) using DMTCl (140 mg, 

0.42 mmol) in anhydrous pyridine (2 cm^) followed by the general work-up procedure and 
column chromatography afforded compound 15d as a white solid material ( 147 mg, 61%); Rf 
0.37 (CH2Cl2/MeOH 98:2, v/v); 5h (CDCI3) 8.46 (1 H, d, J8.0), 8.19-8.00 (7 H, m), 7.61 (2 H, 
dd, J 1.6 and 7.4), 7.48 (4 H, d, J8.3), 7.35 (2 H, dd, 77.2 and 7.5), 7.25 (1 H, m), 7.15 (1 H, m), 

20 6.88 (4 H,d, 79.0), 6.10(1 H,s), 4.46(1 H,s), 4.43(1 H, br s), 4.25 (1 H, d, 78.1), 4.12 (1 H,d, 
78.1), 3.79 (6H, s), 3.71-3.63 (2 H, m), 2.22 (1 H, br s); 4 (CDCI3) 158.7, 149.8, 144.9, 136.1, 
136.0, 135.9, 132.1, 131.4, 130.9, 130.6, 130.3, 130.2, 129.2, 129.1, 128.4, 128.3, 128.2, 128.1, 
127.5, 127.4, 127.0, 126.9, 126.2, 125.5, 125.4,124.9, 124.8, 124.7,123.8, 123.7, 121.9, 113.4, 
86.6, 86.1, 83.2, 82.2, 73.2, 72.4, 60.3, 55.3; m/z (FAB) 649 [M+Hf , 648 [M]\ 

25 

Example 41e: (lR^S,4R,7S)-l-(4,4'-Dlmethoxytrityloxymethyl)-7-hydroxy-3-(2,4,5- 
trimethylphenyl)-2,5-dloxabicyclo[2.2.1]heptane [Compound 15e of Scheme 2] 

Dimethoxytritylation of compound 14e (80 mg, 0.3 mmol) using DMTCl (113 mg, 0.33 

mmol) in anhydrous pyridine (2 cm^) followed by the general work-up procedure and coluirm 
30 chromatography afforded compound 15e as a white solid material (134 mg, 78%); i?/0.32 

(CHjCh/MeOH 98:2, v/v); 6» (CDCI3) 7.55 (2 H, d, 7 7.9), 7.45-7.42 (4 H, m), 7.32-7.21 (4 H, 
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m), 6.93 (1 H, s), 6.84 (4 H, d, J 8.2), 5.20 (1 H, s), 4.40 (1 H, s), 4.08 (1 H, s), 4.04 (1 H, d, J 
8.3), 3.95 (1 H, d, J 8.2), 3.78 (6 H, s), 3.56 (1 H, d, J 10.5), 3.47 (1 H, d, J 10.2), 2.24 (3 H, s), 
2.22 (3 H, s), 2.19 (3 H, s); 4 (CDCI3) 158.6, 145.0, 136.0, 135.7, 134.4, 134.2, 131.8, 
131.3,130.3, 130.2, 128.3, 128.0, 127.2, 126.9, 1 13.3, 86.4, 85.7, 82.1, 81.8, 73.0, 71.8, 60.2, 
5 55.3, 19.6, 19.3, 18.4; m/z (FAB) 567 [M+Hf , 566 [Mf . 

Example 42: General procedure for synthesis of the phosphoramidite derivatives 16a-e as 
shown in Scheme 2. 

2-Cyanoethyl A^,A^'-diisopropylphosphoramidochloridite was added dropwise to a stirred 
10 solution of nucleoside 15a-e and A^y/V-diisopropylethylamine (DIPEA) in anhydrous CH2CI2 at 
room temperature. After stirring the mixture at room temperature for 6 h, methanol (0.2 cm^) was 
added and the resulting mixture diluted with EtOAc (20 cm^, containing 0.5% EtaN, v/v). The 
organic phase was washed, successively, with saturated a. NaHCOs (2 x 10 cm ) and brine (10 
cm^). The separated organic phase was dried (Na2S04), filtered and evaporated to dryness under 
1 5 reduced pressure. The residue obtained was purified by column chromatography [25-30% (v/v) 
EtOAc in «-hexane containing 0.5% Et3N] to give the amidites 16a-e. 

Example 42a: Synthesis of (lR,3S,4R,7S)-7-[2-Cyanoethoxy(diisopropylamino) 
phosphinoxy]-l-(4,4'-dimethoxytrityIoxymethyi)-3-phenyI-2,5-dioxabicycIo[2.2.1] heptane 
20 [Compound 16a of Scheme 2] 

Treatment of compound 15a (170 mg, 0.32 mmol) with 2-cyanoethyl A^,A^- 
diisopropylphosphoramidochloridite (85 mg, 0.36 mmol) in the presence of DIPEA (0.4cm ) and 
anhydrous CH2CI2 (2.0 cm^) followed by the general work-up procedure and column 
chromatography afforded phosphoramidite 16a as a white solid material (155 mg, 66%); i?/0.45, 
25 0.41 (CHaCla/MeOH 98:2, v/v); 5p (CDCI3) 149.3, 148.9. 

Example 42b: (lR,3S,4R,7S)-7-[2-Cyanoethoxy(diisopropylamino) phosphinoxyl-l-(4,4'- 
dimethoxytrityloxymethyl)-3-(4-fluoro-3-methylphenyl)-2,S-dioxabicyclo[2.2.1]heptane 
[Compound 16b of Scheme 2] 

30 Treatment of compound 15b (95 mg, 0.17 mmol) with 2-cyanoethyl NJ^- 

diisopropylphosphoramidochloridite (53 mg, 0.22 mmol) in the presence of DIPEA (0.3cm ) and 
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anhydrous CH2CI2 (2.0 cm ) followed by the general work-up procedure and column 
chromatography afforded phosphoramidite 16b as a white solid material (85 mg, 66%); R/OAS, 
0.41 (CHaCb/MeOH 98:2, v/v); 5p (CDCI3) 149.3, 148.8. 

5 Example 42c: Synthesis of (lR,3S,4R,7S)-7-[2- 

Cyanoethoxy(diisopropylamino)phosphinoxy]-l-(4,4'-dimethoxytrity]oxymethyl)-3-(l- 
naphthyl)-2,5-dioxabicycIo[2.2.1]heptane [Compound 16c of Scheme 2] 

Treatment of compound 5c (158 mg, 0.28 mmol) with 2-cyanoethyl NJsT- 
diisopropylphosphoramidochloridite (75.7 mg, 0.32 mmol) in the presence of DIPEA (0.4cm"^) 
10 and anhydrous CH2CI2 (2.0 cm^) followed by the general work-up procedure and column 

chromatography afforded phosphoramidite 16c as a white solid material (127 mg, 60%); /?/0.47, 
0.44 (CHjCh/MeOH 98:2, v/v); Sp (CDCI3) 149.2, 149.1. 

Example 42d: Synthesis of (lR,3S,4R,7S)-7-[2-Cyanoethoxy(diisopropylamino) 
15 phosphinoxy]-l-(4,4'-dimethoxytrityloxymethyI)-3-(l-pyrenyl)-2,5-dioxabicycIo[ 

2.2.1]heptane [Compound 16d of Scheme 2] 

Treatment of compound 15d (140 mg, 0.22 mmol) with 2-cyanoethyl NJ^- 

diisopropylphosphoramidochloridite (64 mg, 0.27 mmol) in the presence of DIPEA (0.3cm^) and 

anhydrous CH2CI2 (2.0 cm"^) followed by the general work-up procedure and column 
20 chromatography afforded phosphoramidite 16d as a white solid material (124 mg, 68%); /?/0.51, 

0.47 (CH2Cl2/MeOH 98:2, v/v); 5p (CDCI3) 149.4, 149.1. 

Example 42e: Synthesis of (lR53S,4R,7S)-7-[2-Cyanoethoxy(diisopropylamino) 
phosphinoxy]-l-(4,4'-dimethoxytrityloxymethyI)-3-(2,4,5-trimethyIphenyl)-2,5- 
25 dioxabicyclo[2.2.1]heptane [Compound 16e of Scheme 2] 

Treatment of compound 15e (130 mg, 0.23 mmol) with 2-cyanoethyl NJ^- 
diisopropylphosphoramidochloridite (64 mg, 0.27 mmol) in the presence of DIPEA (0.3cm^) and 
anhydrous CH2CI2 (2.0 cm^) followed by the general work-up procedure and column 
chromatography afforded phosphoramidite 16e as a white solid material (1 1 1 mg, 63%); 7J/0.44, 
30 0.42 (CH2Cl2/MeOH 98:2, v/v); S? (CDCI3) 149.0. 
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Example 43: Synthesis, deprotection and purification of oligonucleotides 

All oligomers were prepared using the phosphoramidite approach on a Biosearch 8750 
DNA synthesizer in 0.2 |imol scale on CPG solid supports (BioGenex). The stepwise coupling 
efficiencies for phosphoramidites 16a-c (10 min coupling time) and phosphoramidites 16d and 
5 16e (20 min coupling time) were >96% and for unmodified deoxynucleoside and ribonucleoside 
phosphoramidites (with standard coupling time) generally >99%, in all cases using 1/f-tetrazole 
as activator. After standard deprotection and cleavage from the solid support using 32% aqueous 
ammonia (12 h, 55 °C), the oligomers were purified by precipitation from ethanol. The 
composition of the oligomers were verified by MALDI-MS analysis and the purity (>80%) by 
10 capillary gel electrophoresis. Selected MALDI-MS data ([M-H]'; found/calcd.: ON3 2731/2733; 
ON4 2857/2857; ON6 3094/3093). 

Example 44. Thermal denaturation studies 

The thermal denaturation experiments were performed on a Perkin-Elmer UV/VIS 
15 spectrometer fitted with a PTP-6 Peltier temperature-programming element using a medium salt 
buffer solution (10 mM sodium phosphate, 100 mM sodium chloride, 0.1 mM EDTA, pH 7.0). 
Concentrations of 1.5 mM of the two complementary strands were used assuming identical 
extinction coefficients for modified and unmodified oligonucleotides. The absorbance was 
monitored at 260 nm while raising the temperature at a rate of 1 °C per min. The melting 
20 temperatures (Tm values) of the duplexes were determined as the maximum of the first 
derivatives of the melting curves obtained. 

Example 45: Synthesis of compounds 16a-16e and oligomers containing monomers 17a-17e 

LNA containing the derivatives 17a-17e (Figure 1, Schemel, Scheme 2), were 
25 synthesized, all based on the LNA-type 2'-0,4'-C-methylene-j8-D-ribofuranosyl moiety which is 

known to adopt a locked CV-endo RNA-like furanose conformation [S. Obika, D. Nanbu, Y. 

Hari, K. Morio, Y. In, T. Ishida, and T. Imanishi, Tetrahedron Lett, 1997, 38, 8735; S. K. Singh, 

P. Nielsen, A. A. Koshkin and J. Wengel, Chem. Commun., 1998, 455; A. A. Koshkin, S. K. 

Singh, P. Nielsen, V. K. Rajwanshi, R. Kumar, M. Meldgaard, C. E. Olsen and J. Wengel, 
30 Tetrahedron, 1998, 54, 3607; S. Obika, D. Nanbu, Y. Hari, J. Andoh, K. Morio, T. Doi and T. 

Imanishi, Tetrahedron Lett,, 1998, 39, 5401]. The syntheses of the phosphoramidite building 
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blocks 16a-16e suitable for incorporation of the LNA-type aryl C-glycosides 17a-17e are shown 
in Scheme 1 and Scheme 2 and described in details in the experimental section. In the design of 
an appropriate synthetic route, it was decided to utilize a reaction similar to one described 
recently in the literature. Thus, stereoselective attack of Grignard reagents of various 
5 heterocycles on a carbonyl group of an aldehyde corresponding to aldehyde 11 (Scheme 2) but 
with two 0-benzyl groups instead of the two /7-methoxybenzyl groups of aldehyde 11 (Scheme 
2) has been reported to furnish locked-C-nucleosides [S. Obika, Y. Hari, K. Morio and T. 
hnanishi. Tetrahedron Lett,, 2000, 41, 215; S. Obika, Y. Hari, K. Morio and T. hnanishi, 
. Tetrahedron Lett., 2000, 41, 221]. The key intermediate in the synthetic route selected herein, 

10 namely the novel aldehyde 1.1 was synthesized from the known fiiranoside 1 [R. Yamaguchi, T. 
Imanishi, S. Kohgo, H. Hone and H. Ohrui, Biosci. Biotechnol Biochem., 1999, 63, 736] 
following two different routes, hi general, 0-(p-Methoxy)benzyl protection was desirable instead 
of O-benzyl protection as removal of the benzyl protection at a later stage (i.e. 13 — >14) could 
also likely result in the cleavage of the benzylic 0-Ci bond present, e.g., in compounds 13 and 

15 14 (Scheme 2). In one route to give aldehyde 11, regioselective /7-methoxybenzylation of the 
fiiranoside 1, followed by mesylation and methanolysis yielded the anomeric mixture of the 
methyl furanosides 9. Base induced cyclization followed by acetyl hydrolysis afforded the 
aldehyde 11 in approximately 24% overall yield from 1 (Scheme 1 and Scheme 2). This yield 
was improved to following a different strategy. Thus, di-O-mesylation of 1 followed by 

20 methanolysis and base induced intramolecular nucleophilic attack from the 2-OH group afforded 
the cyclized anomeric mixture of methyl fiiranoside 4. Substitution of the remaining mesyloxy 
group of 4 with an acetate group, followed by deacetylation, /7-methoxybenzylation and then 
acetyl hydrolysis afforded the required aldehyde 11 (Scheme 1). 

Coupling of the aldehyde 11 with different aryl Grignard reagents yielded selectively one 

25 epimer of each of the compounds 12a-e in good yields (see experimental section for further 

details on this and other synthetic steps). Each of the diols 12a-e was cyclized under Mitsunobu 
conditions (TMAD, PBua) to afford the bicyclic j3-C-nucleoside derivatives 13a-e. Oxidative 
removal of the j9-methoxybenzyl protections was achieved in satisfactory yields using DDQ. 
Subsequent, selective 4,4'-dimethoxytritylation ( to give compounds 15a-e) followed by 

30 phosphorylation afforded the phosphoramidite building blocks 16a-e in satisfactory yields. The 
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configuration of compounds 13, and thus also conipounds 11, 12 and 14-17 were assigned based 
on IH NMR spectroscopy, including NOE experiments. 

All oligomers were prepared in the 0.2 jimiol scale using the phosphoramidite approach. 
The stepwise coupling efficiencies for phosphoramidites 16a-c (10 min coupling time) and 
5 phosphoramidites 16d and 16e (20 min coupling time) were >96% and for unmodified 

deoxynucleoside and ribonucleoside phosphoramidites (with standard coupling time) generally 
>99%, in all cases using l//-tetrazole as activator. After standard deprotection and cleavage from 
the solid support using 32% aqueous ammonia (12 h, 55 ^C), the oligomers were purified by 
precipitation from ethanol. The composition of the oligomers were verified by MALDI-MS 
10 analysis and the purity (>80%) by capillary gel electrophoresis. 

Example 46 Thermal denaturation studies to evaluate hybridization properties 

The hybridization of the oligonucleotides ONl-ONll (Table 8 below) toward four 9-mer 
DNA targets with the central base being each of four natural bases were studied by thermal 

15 denaturation experiments (T^ measurements; see the experimental section for details). 

Compared to the DNA reference ONI, introduction of one abasic LNA monomer Ab^ (ON2) 
has earlier been reported to prevent the formation of a stable duplex above 0 (only evaluated 
with adenine as the opposite base) [L. Kvaemo and J. Wengel, Chem, Commun., 1999, 657]. 
With the phenyl monomer 17a (ON3), Tm values in the range of 5-12 ®C was observed. Thus, the 

20 phenyl moiety stabilizes the duplexes compared to Ab^, but universal hybridization is not 
achieved as a preference for a central adenine base in the complementary target strand is 
indicated (Table 8). In addition, significant destabilization compared to the ONlrDNA reference 
duplex was observed. Results similar to those obtained for ON3 were obtained for oligomers 
isosequential with ON3 but containing 17b, 17c or 17e instead of 17a as the central monomer 

25 (Table 8, ON7, ON8 and ON9, respectively). 
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Table 8. Thermal denaturation experiments (Tm values shown) for ONl-ONl 1 towards DNA 
complements with each of the four natural bases in the central position'^ 



DNA target: 3 '-d(CACTYTACG) Y: 

A 

C 

G 

T 

5 

ONI 

5'-d(GTGATATGC) 

28 

11 

12 

19 


ON2 

5'-d(GTGAAb^ATGC) 

<3 

n.d. 

n.d. 

n.d. 


ON3 

5'-d(GTGA17aATGC) 

12 

5 

6 

7 


ON4 

5'-d(GTGA17dATGC) 

18 

17 

18 

19 


ON5 

5 '-d[2 '-OMe(GTGATATGC)] 

35 

14 

19 

21 

10 

ON6 

5'-d[2'-OMe(GT'^GA17dAT'^GC)3 

39 

38 

37 

40 


0N7 

5'-d(GTGA17bATGC) 

15 

7 

6 

8 


ON8 

5'-d(GTGA17cATGC) 

15 

7 

6 

9 


ON9 

5'-d(GTGA17eATGC) 

13 

6 

6 

7 


ONIO 5'-d[2'-OMe(GT^GA17bAT^GC)] 

31 

25 

26 

27 

15 

ONll 5'-d[2'-OMe(GT*'GA17cAT^GC)] 

34 

27 

27 

32 


^ Melting temperatures (Tm values/^C) measured as the maximum of the first derivative of the 
melting curve (A260 v5 temperature) recorded in medium salt buffer (10 mM sodium phosphate, 
100 mM sodium chloride, 0.1 mM EDTA, pH 7.0) using 1.5 |LiM concentrations of the two 
strands; A = adenine monomer, C = cytosine monomer, G = guanine monomer, T = thymine 

20 monomer; See Figure 1 and/or Scheme 2 for structures of T^, Ab^ and 17a-17e; DNA sequences 
are shown as d(sequence) and 2'-0Me-RNA sequences as 2'-0Me(sequence); "n.d." denotes 
"not determined". The data reported for ONI have been reported earlier [A. A. Koshkin, S. K. 
Singh, P. Nielsen, V. K. Rajwanshi, R. Kumar, M. Meldgaard, C. E. Olsen and J. Wengel, 
Tetrahedron, 1998, 54, 3607]. The data reported for ON2 has been reported earlier [L. Kvaem0 

25 and J. Wengel, Chem, Commun., 1999, 657]. 

The pyrene LNA nucleotide 17d (in ON4) displays more encouraging properties (Table 
8). Firstly, the binding affinity towards all four complements is increased compared to ON3 
(containing 17a). Secondly, universal hybridization is observed as shown by the four Tm values 
30 all being within 17-19 °C. With respect to universal hybridization, 17d thus parallels the pyrene 
DNA derivative Py [T. J. Matray and E. T. Kool, 7. Am, Chem. Soc, 1998, 120, 6191], but the 
decrease in thermal stability compared to the ONl:DNA reference is more pronounced for 17d 
(---lO °C) than reported for Py (~5 in a 12-mer polypyrimidine DNA sequence) [T. J. Matray 
and E. T. Kool, 1 Am. Chem. Soc, 1998, 120, 6191]. It therefore appears that stacking (or 
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intercalation) by theO pyrene moiety is not favored by the conformational restriction of the 
fiiranose ring of 17d, although comparison of the thermal stabilities of ON2, ON3 and ON4 
strongly indicate interaction of the pyrene moiety within the helix. 

When measured against an RNA target [3'-r(CACUAUACG)], the Tm values (using 
5 identical experimental conditions as for the experiments descried above) of ON3 was 1 1 .9 
and of ON4 was 12.7 °C. For oligomers ON7, ON8 and ON9 (Table 8), the corresponding Tm 
values were 1 1.7, 8.8 and 10.2 ^C, respectively. 

Example 47: The effect of pyrene LNA monomers in an RNA-like strand. 

10 ON5, ON6, ONIO and ONll (see Table 8 above), were synthesized. The former being 

composed entirely of 2'-OMe-RNA monomers and the latter three of six 2'-OMe-RNA 
monomers (see Figure 1), two LNA thymine monomers T^ (see Figure 1), and one central LNA 
pyrene monomer 17d (oligomer ON6), or one central monomer 17b (ONIO) or 17c (ONll). A 
sequence corresponding to ON6 but with three monomers has earlier been shown to form a 

15 duplex with complementary DNA of very high thermal stability. ON6 is therefore suitable for 
evaluation of the effect of introducing high-affinity monomers around a universal base. As seen 
in Table 8, the 2'-0Me-RNA reference ON5 binds to the DNA complement with slightly 
increased thermal stability and conserved Watson-Crick discrimination (compared to the DNA 
reference ONI). Indeed, the LNA/2'-0Me-RNA chimera ON6 displays universal hybridization 

20 behavior as revealed from the four Tm values (37, 38, 39 and 40 ®C). All four Tm values obtained 
for ON6 are higher than the Tm values obtained for the two fully complementary reference 
' duplexes ONlrDNA (Tm = 28 °C) and ON5:DNA (Tm = 35 ^C). 

These novel data demonstrate that the pyrene LNA monomer 17d display universal 
hybridization behavior both in a DNA context (ON4) and in an RNA-like context (ON6), and 

25 that the problem of decreased affinity of universal hybridization probes can be solved by the 
introduction of high-affinity monomers, e.g. 2'-0Me-RNA and/or LNA monomers. Increased 
affinities compared to ON7 and ON8 were obtained for ONIO and ONll, respectively, but 
universal hybridization behavior was not obtained as a preference for a central adenine base in 
the complementary target strand is indicated (Table 8 above). 

30 


208 


Example 48: Base-pairing selectivity in hybridization probes. 

A systematic thermal denaturation study with ON6 (Table 1 1) was performed to 
determine base-pairing selectivity. For each of the four DNA complements (DNA target strands; 
monomer Y = A, C, G or T) used in the study shown in Table 8 above, ON6, containing a central 

5 pyrene LNA monomer 17d, was hybridized with all four base combinations in the neighboring 
position towards the 3 '-end of ON6 (DNA target strands; monomer Z = A, C, G or T, monomer 
X = T) and the same towards the 5 '-end of ON6 (DNA target strands; monomer X = A, C, G or 
T, monomer Z = T). In all eight subsets of four data points, satisfactory to excellent Watson- 
Crick discrimination was observed between the match and the three mismatches (Table 1 1 

10 below, ATm values in the range of 5-25 °C). 

Table 11. Thermal denaturation experiments (Tm values shown) to evaluate the base-pairing 
selectivity of the bases neighboring the universal pyrene LNA monomer 17d in the 2'-0Me- 
RNA/LNA chimera ON6. In the target strand [3'-d(CAC-XYZ-ACG)], the central three bases 
1 5 XYZ are varied among each of the four natural bases'' 


5'-[2'-OMe(GT^G-A17dA-T*^GC)] 
3'-d(CAC -XYZ-ACG) 


XYZ 


XYZ 

TmTC 

XYZ 

T„/°C 

XYZ 

TW 

TAA 

26 

TCA 

22 

TGA 

22 

TTA 

29 

TAC 

26 

TCC 

29 

TGC 

26 

TTG 

31 

TAG 

24 

TCG 

24 

TGG 

30 

TTC 

32 

TAT 

39 

TCT 

38 

TGT 

37 

TTT 

40 

AAT 

18 

ACT 

27 

AGT 

22 

ATT 

28 

CAT 

30 

CCT 

31 

CGT 

27 

CTT 

35 

GAT 

14 

GCT 

28 

GGT 

16 

GTT 

27 

TAT 

39 

TCT 

38 

TGT 

37 

TTT 

40 


30 ^ See caption below Table 8 for abbreviations and conditions used; The data for matched 
neighboring bases (X = Z = T) are shown in bold. 
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The results reported herein have several important implications for the design of probes 
for universal hybridization: (1) Universal hybridization is possible with a conformationally 
restricted monomer as demonstrated for the pyrene LNA monomer; (2) Universal hybridization 
behavior is feasible in an RNA context; (3) The binding affinity of probes for universal 
5 hybridization can be increased by the introduction of high-affinity monomers without 

compromising the universal hybridization and the base-pairing selectivity of bases neighboring 
the universal base. 

Based on the results reported herein, that chimeric oligonucleotides comprising pyrene 
and other known universal bases attached at various backbones (eg. LNA-type monomers, 
10 ribofuranose monomers or deoxyribose monomers in 2'-0Me-RNA/LNA chimeric oligos) 

likewise will display attractive properties with respect to universal hybridization behavior. For 
example, an oligomer identical with the 2'-0Me-RNA/LNA oligo ON6 but with the 17d 
monomer substituted by a pyrenyl-2'-OMe-ribonucleotide monomer. 

15 Example 49: Chimeric oligonucleotides 

These chimeric oligonucleotides are comprised of pyrene and other known universal 
bases attached at various backbones (e.g. LNA-type monomers, ribofiiranose monomers or 
deoxyribose monomers in 2'-0Me-RNA/LNA oligos). Experimentation with these chimeric 
oligonucleotides are for evaluating the possibility of obtaining similar results to the 2'-0Me- 
20 RNA/LNA oligo ON6 at a lower cost, for example, by substituting Py*" with a pyrenyl-2'-0Me- 
ribonucleotide monomer. 

Example 50: The use of LNA Oligonucleotide Microarrays Provides Superior Sensitivity 
and Specificity in Expression Profiling 

25 A. In vitro synthesis of the yeast spike RNAs 

Amplification of the yeast genes was carried out by standard two-step PCR using yeast 

genomic DNA as template (21). In the second PCR, a poly-T2o tail was inserted in the amplicon. 

The DNA fragments were ligated into the pTRIamplS vector (Ambion, USA) using the Quick 

Ligation Kit (New England Biolabs, USA) according to the manufacturers instructions and 
30 transformed into E. coli DH-5a (21). Synthesis of in vitro RNA was done using the 

MEGAscript™ T7 Kit (Ambion, USA) according to the manufacturers instructions. 
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B. Design of the LNA expression arrays 

Capture probes were designed using the OligoDesign^'^ software as described in the 
previous examples. 

C. Printing and hybridisation of the LNA expression microarrays 

5 The LNA oligonucleotide microarrays were printed onto Immobilizer™ MicroArray 

Slides (Exiqon, Denmark) using the Packard Biochip I Arrayer (Packard, USA), with a spot 
volume of 2 X 300 pi of a 10 |LiM capture probe solution. Four replicas of each capture probe 
were printed on each slide. Mixed staged Caenorhabditis elegans worm cultures were cultivated 
according to standard protocols. RNA was extracted fi-om worm samples using the FastRNA Kit, 

10 GREEN (Q-BIO, USA) according to the supplier's instructions. Fluorochrome-labelled first 
strand cDNA was synthesized from worm total RNA or in vitro synthesized RNA as described 
(22) followed by purification of the cDNA target, hybridisation of the microarrays ovemight at 
65°C, washing of the slides and drying of the arrays (22). The slides were scanned using a 
ScanArray 4000 XL scanner (Perkin-Elmer, USA), and the array data were processed using the 

1 5 GenePix™ Pro 4.0 software package (Axon, USA). 

D. Assessment of sensitivity and specificity in LNA expression microarrays 

To enable direct comparisons between LNA and DNA capture probes in measuring gene 
expression levels, specific oligonucleotide capture probes for the Saccharomyces cerevisiae 

20 genes SWI5 and THI4 were designed in the 3 '-end of the two ORFs. The capture probes were 
synthesized as 50-mer DNA and corresponding LNA-modified oligonucleotides, respectively, 
with an LNA substitution at every 3rd nucleotide position. In addition, 40-mer DNA and LNA 
oligonucleotides were designed as truncated versions of the 50-mer capture probes, along with 
oligonucleotides with 1 to 5 consecutive mismatches positioned centrally in the 50-mer and 40- 

25 mer capture probes. All capture probes were synthesized with an anthraquinone group at the 5'- 
end and a hexaethyleneglycol dimer linker region (HEG2 spacer arm) enabling photocoupling 
onto polymer microarray slides as described in US patent No. 6,53 1 ,591 . 

To assess the sensitivity and specificity of the oligonucleotide microarrays, in vitro 
synthesized yeast RNA for either SWI5 or THI4 was spiked into C. elegans total RNA for cDNA 

30 target synthesis followed by hybridization of the microarrays with fluorochrome-labelled cDNA 
target pools. The incorporation of LNA nucleotides into 50-mer DNA oligonucleotide capture 
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probes results in a 3 to 4-fold increase in fluorescence intensity levels, when hybridized with the 
spiked, complex cDNA target pools under standard stringency conditions (Figures 36a and 36c). 
The sensitivity increase is even more pronounced, 5 to 12-fold, when 40-mer LNA capture 
probes are employed. None of the yeast capture probes showed cross-hybridization to C elegans 
5 cDNA target control without yeast spike RNA under the same conditions. 

The specificity of the oligonucleotide capture probes was examined using a panel of LNA 
mismatch oligonucleotides together with the DNA controls. As demonstrated in Figures 36 a and 
36c, the fluorescence intensities obtained with the LNA-modified 40-mer triple mismatch 
oligonucleotides show a 3-fold intensity decrease relative to the perfectly matched duplexes. In 

10 contrast, the corresponding 40-mer standard DNA capture probes are neither capable of forming 
stable duplexes nor discriminating between the perfect match and mismatched targets under 
standard hybridization stringency conditions, resulting in low intensity values from all 40-mer 
DNA capture probes (Figures 36a and 36c). Interestingly, mismatch discrimination with the 50- 
mer LNA probes could be significantly improved by increasing the hybridization temperature 

15 from 65 °C to 70 °C (Figures 36b and 36d), without compromising their capture sensitivity. By 
comparison, the signal intensities from all 50-mer DNA capture probes including the perfect 
match oligonucleotides were reduced under the same conditions (Figures 36b and 36d). 
Considered together, our results strongly support the contention that LNA oligonucleotide 
capture probes are significantly more sensitive and specific than DNA probes, being able to 

20 discriminate between highly homologous (90 %) mRNAs with a 5 to 10-fold increase in 
sensitivity. 

In a typical cell, mRNAs can be subdivided into three kinetic classes: (i) highly abundant 
(30-90 % of the total mRNA mass, 0.1 % of the sequence complexity); (ii) medium abundant (50 
% mass, 2-5 % of complexity); and (iii) low-abundant mRNAs (<1% mass, >90 % of 

25 complexity). In addition, alternative splicing has been shown to be prevalent in higher 
eukaryotes, where at least 50 % of the genes appear to be alternatively spliced, thereby 
generating additional diversity within the transcriptome. It is thus of utmost importance that the 
dynamic range, sensitivity and specificity of the expression profiling technology used are 
optimal, especially when analyzing expression levels of messages and mRNA splice variants 

30 belonging to the low-abundant class of high mRNA sequence complexity. A common problem 
for all DNA oligonucleotide microarrays is the need for an adequate compromise with respect to 
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the sensitivity and specificity of the platform. In the present example LNA oligonucleotide 
microarrays perform better in expression profiling than microarrays with corresponding DNA 
probes. Our results clearly demonstrate that both the specificity and sensitivity in target molecule 
capture can be improved using LNA oligonucleotide microarrays, enabling discrimination 
5 between highly homologous mRNAs and alternative splice variants with a simultaneous increase 
in sensitivity. 

Figures 36a-36d shows the sensitivity and specificity of LNA oligonucleotide capture 
probes (black bars) compared to DNA capture probes (white bars) on expression microarrays. 
Fluorescence intensity is shown in arbitrary units (relative measurements). The arrays 

10 comprising 50-mer and 40-mer perfect match and 1-5 mismatch capture probes were hybridized 
at eS^'C in 3xSSC with Cy3-labelled cDNA from 10 |ig C elegans total RNA spiked with yeast 
a) SWI5 RNA and c) THI4 RNA. Demonstration of improved mismatch discrimination with the 
50-mer LNA probes by increasing the hybridization temperature fi'om 65 °C to 70 °C hybridized 
with Cy3-labelled cDNA fi-om 10 |iig C elegans total RNA spiked with yeast b) SWI5 RNA and 

15 d)THI4RNA. 

Example 51. Improved sensitivity in the on-chip capture of yeast HSP78 mRNA using 
LNA-substituted 25-mer oligonucleotide capture probes. 

A. Capture probe design 

20 Unique capture probes for the yeast HSP78 gene were designed using the OligoDesign 

software, described in Figs. 19A-19F. The design options used were: (i) length of each 
oligonucleotide probe was 25 nucleotides; (ii) Blast word length 7; (iii) Blast expectation cut-off 
1000; other options were as default; 24 DNA capture probes were selected. Furthermore, three 
different LNA-substituted probes were designed based on the sequences of the 24 DNA capture 

25 probes, selected by OligoDesign: optimal LNA_T, optimal LNA_TC and LNA_3. In the LNA_T 
design the DNA t nucleotides were substituted with LNA T. For the LNA TC design, LNA T 
and C nucleotides were used to substitute DNA t and c. In the LNA_3 design every third DNA 
nucleotide was substituted with the corresponding LNA nucleotide. For the LNA T and 
LNA TC design, no blocks of LNAs were allowed; in addition the LNAs were substituted in a 

30 pattern providing a more narrow duplex melting temperature range compared to the DNA Tm 
range. In addition, an equivalent set of capture probes with a single mismatch in the central 
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nucleotide position was designed. Altogether, 192 capture probes were designed including a 
anthraquinone (AQ29) 5 '-modifier and a hexaethyleneglycol dimer (HEG2) at the 5 'end of each 
probe - as shown in Table 13. 

B. Determination of the duplex melting temperatures (Tm). 
5 The duplex melting temperatures of the DNA, LNA T and LNA TC designed probes 

were measured using the Perkin Elmer Lambda 40 Spectrophotometer and according to 
Wahlestedt et al. PNAS 97/10 2000. All oligos were measured twice and if the replica values 
deviated more than 1*^C, then a third or a fourth measurement was carried out. The average Tm 
for each oligonucleotide duplex is presented in Table 14, 

10 C. /« vitro synthesis of fluorochrome-labeled yeast HSP78 RNA 

CI. Genomic DNA was prepared from a wild type standard laboratory strain of 
Saccharomyces cerevisiae using the Nucleon MiY DNA extraction kit (Amersham Biosciences) 
according to supplier's instructions. 

CI. PGR amplification of the partial yeast gene was done by standard PGR using yeast 

15 genomic DNA as template. In the first step of amplification, a forward primer containing a 

restriction enzyme site and a reverse primer containing a universal linker sequence were used. In 
this step 20 bp was added to the 3 '-end of the amplicon, next to the stop codon. In the second 
step of amplification, the reverse primer was exchanged with a nested primer containing a poly- 
T20 tail and a restriction enzyme site. The SWI5 amplicon contains 730 bp of the SWI5 ORF plus 

20 20 bp universal linker sequence and a poly-A2o tail. 
The PGR primers used were: 

YDR258G-For-SacI: acgtgagctcttttgacatgtcagaatttcaag (SEQ ID NO: ) 
YDR258G-Rev-Uni: gatccccgggaattgccatgttacttttcagcttcctcttcaac (SEQ ID NO: ) 
Uni-polyT-BamHI: acgtggatccttttttttttttttttttttgatccccgggaattgccatg, (SEQ ID NO: ) 
25 C3. Plasmid DNA constructs 

The PGR amplicon was cut with the restriction enzymes, £coRI + BamHl, The DNA 
fi-agment was ligated into the pTRIamplS vector (Ambion) using the Quick Ligation Kit (New 
England Biolabs) according to the supplier's instructions and transformed into E, coli DH-5a by 
standard methods. 
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C4. DNA sequencing 

To verify the cloning of the PGR amplicon, plasmid DNA was sequenced using Ml 3 
forward and Ml 3 reverse primers and analysed on an ABI 377. 
C5. Biotin labeling of cRNA 
5 One ^g of plasmid containing the HSP78 sequence was linearized with restriction 

enzyme BamHI (Amersham Pharmacia Biotech, USA) for 2 hours at 37 °C. The RNA was 
labeled with biotin-CTP and biotin-UTP using the Message AMP aRNA kit from Ambion (USA) 
according to the manufacturer's instructions. Following hybridisation, the slides were stained 
with Streptavidin Phycoerythrin (Molecular Probes, S-866, USA) according to the GeneChip 

10 Expression Analysis Technical Manual (Affymetrix, USA) 
C6. Fluorochrome labeling of spike RNA 

In vitro synthesized spike RNA from the HSP78 plasmid construct was labeled with 
either Gy3-ULS or Gy5-ULS (Amersham Biosciences, USA) according to the manufacturer's 
instructions, followed by filtration through a ProbeQuant G50 Micro Golumn and Microcon30 

1 5 (Millipore, USA). The labeling efficiency was monitored using the Nanodrop spectrophotometer 
(Nanodrop Technologies, USA) 

D. Microarray fabrication 

The microarrays were printed on Immobilizer™ MicroArray Slides (Exiqon, Denmark) 
using the MicroGrid II from Biorobotics (UK) using a 20 [xM capture probe solution for each 
20 oligonucleotide probe. Four replicas of each capture probe were printed on the slides. 

E. Hybridization with fluorochrome-labelled cRNA 

The arrays were hybridized for 16 hours using the following protocol. The labelled RNA 
samples were hybridized in a hybridization solution (20 \iL final volume) containing 3xSSC 
(final concentration), 25 mM HEPES, pH 7.0 (final concentration), 1.25 |ig/|iiL yeast tRNA, 

25 0.3% SDS. The labeled RNA target sample was filtered in a Millipore 0.22 micron spin column 
according to the manufacturer's instructions (Millipore, USA), and the probe was denatured by 
incubating the reaction at lOOX for 2 min. The sample was cooled at 20-25°C for 5 min. by 
spinning at max speed in a microcentrifiige. A LifterSlip (Erie Scientific Company, USA) was 
carefiilly placed on top of the microarray spotted on Immobilizer™ MicroArray Slide and the 

30 hybridization mixture was applied to the array from the side. An aliquot of 30 |LlL of 3xSSC was 
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added to both ends of the hybridization chamber, and the Immobilizer™ MicroArray Slide was 
placed in the hybridization chamber. The chamber was sealed watertight and incubated at 45^C, 
55X or 65°C for 16-18 hours submerged in a water bath. After hybridisation, the slide was 
removed carefully from the hybridization chamber and washed using the following protocol. The 
Lifterslip coverslip was washed off in 6xSSC, pH 7.0 containing 0.1% Tween20 at 50°C for 15 
min., followed by washing of the microarrays in 0.4xSSC, pH 7.0 at 50°C for 30 min. Finally the 
slides were washed for 5 seconds in O.OSxSSC, pH 7.0. The slides were then dried by 
centrifugation in a swinging bucket rotor at approximately 200 G for 2 min. 

F. Data analysis. 

Following washing and drying, the slides were scanned using a ScanArray 4000XL 
scanner (Perkin-Elmer Life Sciences, USA), and the array data were processed using the 
GenePix™ Pro 4.0 software package (Axon, USA). 

G. RESULTS 

Gl. Duplex melting temperatures 

The Tm data clearly shows that LNA-substituted oligonucleotide capture probes have a 
significantly increased average duplex melting temperature compared to the corresponding DNA 
probes. Furthermore, the difference in melting temperature between the perfectly matched (PM) 
and single mismatched (MM) probes, designated as ATm, is significantly higher than the 
corresponding ATm for DNA probes (Table 12). 
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Table 12. The average difference in melting temperature between the perfectly matched (PM) 
and single mismatched (MM) probes in different capture probe designs. The observed difference 
between the DNA and LNA substituted probes is statistically significant as revealed by a t-Test; 
Two-Sample Assuming Unequal Variances. 



Average 

Max 

Min 

ATm 

Stdev. 

T-Test 

MM-DNA 

60.4 

68.5 

54.3 




MM- LNA T 

67.6 

74 

58.1 




MM-LNA TC 

72.0 

79.3 

61.6 




PM-DNA 

66.3 

72.8 

61 

5.9 

1.52 


PM-LNA T 

74.2 

80.8 

66.3 

6.6 

1.32 

0.047 

PM-LNA TC 

80.0 

86.8 

71.4 

8.0 

2.65 

0.001/0.017 


5 

G2. Microarray hybridization results 

Both LNA_T and LNA_3 substituted 25-mer probes are capable of providing highly 
accurate measurements for fold-of-changes in gene expression levels, as depicted in Figure 37. 
The DNA capture probes did not provide any hybridisation signals under the given microarray 

10 hybridisation conditions (Fig. 37). Figure 37 shows the expected (black bars) and observed 

(white bars) fold-of-change in the expression levels of the Cy3-ULS labelled HSP78 spike RNA 
as measured by on-chip capture using different oligonucleotide capture probes. In the 
hybridization experiment, 1 ng HSP78 in vitro spike RNA or 200 pg HSP78 in vitro spike RNA 
was used, respectively. Thus, the fold change of the HSP78 RNA in the two hybridizations in the 

15 comparison is 5-fold. Fourteen additional synthetic in vitro mRNA spike controls were included 
in the hybridisation solution as a semi-complex background RNA mixture. Seven of these spikes 
were used as normalization controls, the other seven were used as negative controls. 
Hybridization temperature was 65°C for 16 hours, and post-hybridization washes as described 
above. Under these conditions the DNA capture probes did not produce hybridization signals. 

20 Figure 38 shows measured intensity levels by on-chip capture using three different 25-mer 

oligonucleotide capture probe designs. One (1) ng biotin-labeled HSP78 target was used in the 
hybridization experiments, followed by staining with Streptavidin Phycoerythrin. The LNA T 
and LNA_TC substituted 25-mer capture probes show a significantly enhanced on-chip capture 
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of the HSP78 RNA target, compared to the DNA 25-mer control probes under four different 
hybridization stringency conditions. 

Table 13. Design of the yeast HSP78 capture probes. YDR258C denotes the ORF name of the 5. 
5 cerevisiae HSP78 gene. The numbers refer to the nucleotide position from the 3 '-end of the 
HSP78 mRNA sequence. PM = perfectly matched probe, MM = single mismatch probe, LNA 
substitutions are depicted by capital letters, "^C denotes LNA methyl-C (SEQ ID NOs: ) 


Oligo name 


Sequence 

Oligo name 



Sequence 

YDR258C_PM_ 

.043 

tttggtagcacgacaagcttagtat 

YDR258C_PM_ 

043T 


TTTggTagcacgacaagcTTagTat 

YDR258C_PM_ 

078 

cactctaacagtttcgccgtttcta 

YDR258C_PM_ 

078T 


cacTcTaacagtTtcgccg i 11 cTa 

YDR258C_PM_ 

124 

gttgccatggagttcaaaatctgtc 

YDR258C_PM_ 

_124T 


gTTgccaTggagTtcaaaaTcTgTc 

YDR258C_PM_ 

164 

tcaatggccttgcaccatataattg 

YDR258C_PM_ 

_164T 


TcaaTggccTTgcaccaTaTaaTTg 

YDR258C_PM_ 

_201 

tcagttagccaatccttcgcttcat . 

YDR258C_PM_ 

_201T 


TcagTTagccaaTccTTcgcTTcat 

YDR258C_PM_ 

.249 

tttttcggccaaacgatcttgaatt 

YDR258C^PM_ 

_249T 


TtTTTcggccaaacgaTcTTgaaTt 

YDR258C_PM_ 

_295 

attgacctcaaaactttcttggata 

YDR258C_PM_ 

_295T 


aTTgaccTcaaaacTTTcTTggaTa 

YDR258C_PM_ 

.356 

gatgaactcaggtggataggatctt 

YDR258C_PM_ 

_356T 


gaTgaacTcaggXggaTaggaTcTt 

YDR258C_PM_ 

_424 

accatcatcacccaactttgtgtcg 

YDR258C_PM_ 

_424T 


accaTcaTcacccaacTTTgTgTcg 

YDR258C_PM_ 

433 

cccaactttgtgtcgtttaataaaa 

YDR258C._PM_ 

_433T 


cccaacTTTgTgTcgTTTaaTaaaa 

YDR258C_PM_ 

_486 

caatgatcgtgttacggaaatcaac 

YDR258C_PM_ 

486T 


caaTgaTcgTgtTacggaaaTcaac 

YDR258C_PM_ 

_515 

tggcctagggaatcggtcagcttac 

YDR258C_PM_ 

_515T 


TggccTagggaaTcggTcagcTTac 

YDR258C_PM_ 

_566 

ttggaaacatcggggtgcgcttttt 

YDR258C_PM_ 

_566T 


TTggaaacaTcggggTgcgcTTTTt 

YDR258C_PM_ 

_569 

gaaacatcggggtgcgctttttcaa 

YDR258C_PM_ 

_569T 


gaaacaTcggggTgcgcTTTTTcaa 

YDR258C_PM. 

_604 

aaaacgacagcataaggctttcttc 

YDR258C_PM 

604T 


aaaacgacagcaTaaggcTTTcTTc 

YDR258C_PM. 

.631 

gacagcctcagttaattggccacca 

YDR258C_PM_ 

_631T 


gacagccTcagtTaaTTggccacca 

YDR258C_PM_ 

_686 

ccgattaaacgagagacagtatgct 

YDR258C..PM_ 

_686T 


ccgaTTaaacgagagacagTaTgct 

YDR258C_PM_ 

151 

atcaaataggaattcagctaaagcc 

YDR258CPM_ 

151T 


aTcaaaTaggaaTTcagcTaaagcc 

YDR258C_PM_ 

_813 

gacctaagaacataaagctggcaat 

YDR258C_PM_ 

mi 


gaccTaagaacaTaaagcTggcaat 

YDR258C_PM. 

_823 

ataaagctggcaataggtctctttt 

YDR258C_PM. 

_823T 


aTaaagcTggcaaTaggTcTcTTTt 

YDR258C__PM_ 

_870 

agacgtacagcatcagaaatagcag 

YDR258C_PM, 

_870T 


agacgTacagcaTcagaaaTagcag 

YDR258C_PM. 

_888 

aaatagcagcaatggcctcgtcttg 

YDR258C_PM. 

_888T 


aaaTagcagcaaTggccTcgTcTTg 

YDR258C_PM. 

_890 

atagcagcaatggcctcgtcttggc 

YDR258C_PM, 

_890T 


aTagcagcaaTggccTcgTcTTggc 

YDR258C_PM. 

_896 

gcaatggcctcgtcttggccaacga 

YDR258C_PM. 

_896T 


gcaaTggccTcgTcTTggccaacga 

YDR258C_PM_043 

TtTggTagmCamCgamCaagmCtT 





TC 


agTat 

YDR258C_PM. 

_LNA3_ 

_043 

TttGgtAgcAcgAcaAgcTtaGtat 

YDR258C_PM. 

_078 

mCamCtmCtaamCagtTtcgmCcgT 

YDR258C^PM. 

_LNA3_ 

_078 

mCacTctAacAgtTtcGccGttTcta 
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Oligo name 

Sequence 

Oligo name 



Sequence 

I U 

1 1 mv^ 1 a 





Y UK^ joC_r IM_ 1 ^4 

g 1 tgmumca i ggag i tmcaaaa i m 






CtgTc 

Y L/KZ J ol^_r JYl_ 

I Xl A ^ 

_LlMAJ_ 


unoccAtguag i icAaaAtc i gtc 

YDR2 5 oC_r M_ 1 o4 

1 mCaa 1 ggmCc 1 1 gmCacmCa 1 a 





1 U 

TaaTTg 

Vr*D0<9l^ D\>l 

y Ulv£DoL'_r M_ 


1 HA 
I04 

1 caAtgocc I igmuacmcatAtaAttg 

YDR23oC_r M_20 1 

I mCag I tagcmCaa 1 cmCt 1 mCgm 





1 

CttmCat 

Y UKZJoC_rM_ 

1 XT A 0 

_4z4 

AccAtcAtcAccmCaamCttTgtGtcg 

YDR2 5 8C_PM_249 

TtTtTmCggmCmCaaamCgatmCt 





rc 

TgaaTt 

YDR258C_PM_ 

1 XT A 1 

_LNA3_ 

486 

mCaa I ga 1 eg 1 gt I acOgaAatmCaac 

YDR258C__PM_295 

aTTgamCmCTmCaaaamCTTtmC 






TTggaTa 

Y UK2DoC_r 1W_ 

1 XI A ? 

C 1 c 
DID 

TggmCctAggGaaTcgGtcAgcTtac 

YDR2 58C_PM_356 

gaTgaamCTmCaggTggaTaggaT 







YUR25oC_PM_ 

1 XT A 1 

_LNA3_ 

am 
_5oo 

1 tguaaAca l cgOgg J gcOct I ttt 

YDk2 5 8C_PM_424 

amCcaTcaTcacmCmCaaccTtgTgt 





TC 

mCg 

YDR258C,PM. 

_LNA3_ 

5 69 

OaaAca 1 cgOgg I gcOct I tt 1 caa 

YDR258C_PM_433 

mCmCmCaacTTtgTgTcgTTTaaT 





TC 

aaaa 

YDR258C^PM_ 

_LNA3_ 

604 

AaaAcgAcaOca 1 aaOgc 1 ttmCttc 

YDR2 5 8C_PM_486 

mCaaTgaTmCgTgtTamCggaaaT 





TC 

mCaac 

YDR258C_PM_LNA3_ 

_757 

AtcAaaTagGaaTtcAgcTaaAgcc 

YDR258C_PM_5 1 5 

TggmCcTagggaaTmCggTcagmCt 





TC 

Tac 

YDR258C_PM_ 

_LNA3. 

oil 

_813 

GacmCtaAgaAcaTaaAgcTggmCaat 

YDR258C_PM_566 

TTggaaamCatmCggggTgmCgctT 





TC 

tTt 

YDR258C_PM_ 

_LNA3_ 

_823 

AtaAagmCtgGcaAtaGgtmCtcTttt 

YDR2 58C_PM_569 

gaaamCaTmCggggTgmCgctttTtm 





IC 

Caa 

YDR258C^PM_ 

_LNA3. 

OTA 
_0 /U 

AgamCgtAcaGcaTcaGaaAtaGcag 

YDR258C_PM_604 

aaaamCgamCagmCaTaaggmCTt 




AaaTagmCagmCaa 1 ggmCctmCgtmCt 

IC 

TmCtTc 

YDR258C_PM__LNA3. 

QQQ 
OOO 


YDR258C_PM_63 1 

gamCagmCcTcagtTaaTTggcmCa 





IC 

cmCa 

YDR258C_PM_ 

_LNA3. 

_o9U 

A A A A A A AAT'AnT'A«''T' AiAA 

AtaucaucaAtgocc i eg i ct i gge 

YDR258C_PM_686 

mCmCgaTTaaamCgagaganiCagT 





1 C 

aTgmCt 

YDR258C^PM_ 

_LNA3. 

_o9o 

GcaAtgGccTegTctTggmCcaAcga 

YDR258C_PM_757 

aTmCaaaTaggaaTtmCagmCTaaa 





1 C 

gmCc 

YDR258C_PM. 

_LNA3. 

it's 1 

_631 

OacAgcmCte Agt I aa i tgOee Aeca 

x/i~\"n ^ o/^ i>x if oil 

YDR258C_PM_8 1 3 

gamCmCTaagaamCaTaaagmCTg 





TC 

gmCaat 

YDR258C_PM. 

_LNA3. 

686 

mCcgAttAaamCgaGagAcaGtaTgct 

YDR258C_PM_823 

aTaaagmCTggmCaaTaggTmCTc 





TC 

TTTt 

YDR258C_PM. 

_LNA3. 

_356 

GatGaamCtcAggTggAtaGgaTctt 

YDR258C_PM_870 

agamCgTamCagmCaTmCagaaaT 





TC 

agmCag 

YDR258C_PM, 

_LNA3. 

_201 

TeaGttAgcmCaaTccTtcGctTcat 
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Oligo name 


Sequence 

Oligo name 

Sequence 

YDK25oC_rivl_ 


aaa 1 agmCagmCaaTggmCcTmCg 



TC 


TctTg 

YDR258C^PM^LNA3_249 

TttTtcGgcmCaaAcgAtcTtgAatt 

YDR258C_PM_ 

890 

aTagmCagcaaTggmCcTcgtmCtT 



TC 



YDR258C PM LNA3 295 

AttGacmCtcAaaActTtcTteGata 

YDR258C PM 

896 




TC 


gcaatggcctmCgTmCttggccaacga 

YDR258C PM_LNA3_433 

mCccAacTttGtgTcgTttAatAaaa 

YDR258C MM 

043 

tttggtagcacgtcaagcttagtat 

YDR258C MM 043T 

TTTeeTaecacgtcaaecTTaeTat 

YDR258C MM 

078 

cactctaacagtatcgccgtttcta 

YDR258C MM 078T 

cacTcTaacagtatcgccgTTTcTa 

YDR258C MM 

_124 

&tt&ccat&&a&atcaaaatct&tc 

YDR258C MM 124T 

fiTTeccaTeeaeatcaaaaTcTeTc 

YDR258C MM 

164 

tcaatggccttggaccatataattg 

YDR258C MM 164T 

TcaaTggccTTggaccaTaTaaTTg 

YDR258C_MM_ 

_201 

tcagttagccaaaccttcgcttcat 

YDR258C_MM_20 1 T 

TcagTTagccaaaccTTcgcTTcat 

YDR258C MM 

249 

tttttcggccaatcgatcttgaatt 

YDR258C_MM_249T 

TtTTTcggccaatcgaTcTTgaaTt 

YDR258C_MM_ 

_295 

attgacctcaaatctttcttggata 

YDR258C_MM_295T 

aTTgaccTcaaatcTTTcTTggaTa 

YDR258C MM 

356 

2atBaactca&&a&&atafi&atctt 

YDR258C MM 356T 

saTsaacTca^EacEaTaEsaTcTt 

YDR258C MM 

424 

accatcatcaccgaactttgtgtcg 

YDR258C MM 424T 

accaTc aTc ac cgaacTTTgTgTcg 

YDR258C MM 

433 

cccaactttgtgacgtttaataaaa 

YDR258C MM 433T 

cccaacTTTgTgacgTTTaaTaaaa 

YDR258C MM 

486 

caatgatcgtgtaacggaaatcaac 

YDR258C MM 486T 

caaTgaTc gTgtaac ggaaaTc aac 

YDR258C MM 

515 

teecctaeesaaace&tcaecttac 

YDR258C MM 515T 

TeeccTae22aaacBeTca2cTTac 

YDR258C MM 

566 

tteeaaacatceceptffcecttttt 

YDR258C MM 566T 

TTeeaaacaTcscceTecficTTTTt 

YDR258C MM 

569 

saaacatc e e e ea ec ^ctttttcaa 

YDR258C MM 569T 

paaacaTcppppapcpc 1 I'TTTcaa 

YDR258C MM 

604 

aaaacgacagcaaaaggctttcttc 

YDR258C MM 604T 

aaaacgacagcaaaaggcTTTcTTc 

YDR258C MM 

631 

gacagcctcagtaaattggccacca 

YDR258C MM 63 IT 

gacagccTcagtaaaTTggccacca 

YDR258C MM 

686 

ccgattaaacgacagacagtatgct 

YDR258C MM 686T 

ccgaTH'aaacgacagacagTaTgct 

YDR258C MM 

757 

ate aaataggaaatc agctaaagcc 

YDR258C_MM_757T 

aTc aaaTaggaaatcagcTaaagcc 

YDR258C_MM_ 

_813 

sacctaacaacaaaaasctcecaat 

YDR258C_MM_81 3T 

gaccTaagaacaaaaagcTggcaat 

YDR258C_MM. 

_823 

ataaagctggcattaggtctctttt 

YDR258C_MM_823T 

aTaaagcTggcaaaaggTcTcTTTt 

YDR258C_MM. 

_870 

agacgtacagcaacagaaatagcag 

YDR258C_MM_870T 

agacgTacagcaacagaaaTagcag 

YDR258C_MM. 

_888 

aaatagcagcaaaggcctcgtcttg 

YDR258C MM 888T 

aaaTagc a gc aa aggcc Tc gTcTTg 

YDR258C_MM. 

_890 

atagcagc aatgccctc gtcttggc 

YDR258C_MM_890T 

aTagcagcaaTgcccTcgTcTTggc 

YDR258C_MM, 

_896 

gcaatggcctcgacttggccaacga 

YDR258C^MM_896T 

gcaaTggccTcgacTTggccaacga 

YDR258C_MM_043 TtTggTagmCamCgtcaagmCtTag 



TC 


Tat 

YDR258C_MM^LNA3_043 TttGgtAgcAcgTcaAgcTtaGtat 

YDR258C_MM_078 mCamCtmCtaamCagtatcgmCcgT 



TC 


tTmCTa 

YDR258C_MM_LNA3_078 mCacTctAacAgtAtcGccGttTcta 

YDR258C_M M_l 24 gTtgmCmCaTggagatmCaaaaTmC 



TC 


tgTc 

YDR258C_MM_LNA3_1 24 GttGccAtgGagAtcAaaAtcTgtc 
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Oligo name Sequence 


Oligo iiame 


Sequence 


YDR258C_MM_164 TmCaaTggmCcTTggacmCaTaTa 
TC aTTg 

YDR2 58C_M M_20 1 tmCagTtagcmCaaacctTcgmCttm 
TC Cat 

YDR258C_MM_249 TtTtTmCggmCmCaatcgatmCtTg 
TC aaTt 

YDR258C_MM_295 aTTgamCmCTmCaaatcTTtmCTT 
TC ggaTa 

YDR258C_MM_356 gaTgaamCTmCaggaggaTaggaTm 
TC CTt 
YDR258C_MM_424 

TC amCcaTcaTcacccaaccTtgTgtmCg 
YDR258C_MM_433 itiCmCmCaacTTtgTgacgTTTaaT 

TC aaaa 

YDR258C_MM_486 mCaaTgaTmCgTgttamCggaaaTm 
TC Caac 

YDR258C_MM_5 1 5 TggmCcTagggaaacggTcagmCtTa 
TC c 

YDR258C_MM_566 TTggaaamCatmCgcggTgmCgctTt 
TC Tt 

YDR258C_MM_569 gaaamCaTmCggggagmCgctttTtm 

TC Caa 

YDR258C_MM_604 aaaamCgamCagmCaaaaggmCTtT 
TC mCtTc 

YDR258C_1VIM_63 1 gamCagmCcTcagtaaaTTggcmCa 
TC cmCa 

YDR258C_MM_686 mCinCgaTTaaaniCgacagamCagT 
TC aTgmCt 

YDR258C_MM_757 aTmCaaaTaggaaatmCagmCTaaag 
TC mCc 

YDR258C_MM_813 gamCmCTaagaamCaaaaagmCTg 
TC gmCaat 

YDR258C_M M_823 aTaaagmCTggmCattaggTmCTcT 
TC TTt 

YDR258C_MM_870 agamCgTamCagmCaacagaaaTag 
TC mCag 

YDR258C_MM_888 aaaTagmCagmCaaaggmCcTmCg 
TC TctTg 

YDR258C_MM_890 aTagmCagcaaTgcccTcgtmCtTggc 


YDR258C__MM_LNA3_1 64 TcaAtgGccTtgGacmCatAtaAttg 

YDR258C_MM_LNA3_201 TcaGttAgcmCaaAccTtcGctTcat 

YDR258C_MM_LNA3_249 TttTtcGgcmCaaTcgAtcTtgAatt 

YDR258C_MM_LNA3_295 AttGacmCtcAaaTctTtcTtgGata 

YDR258C_MM_LNA3_356 GatGaamCtcAggAggAtaGgaTctt 

YDR258C_MM_LNA3_424 AccAtcAtcAccGaamCttTgtGtcg 

YDR258C__MM_LNA3_433 mCccAacTttGtgAcgTttAatAaaa 

YDR258C_MM_LNA3_486 mCaaTgaTcgTgtAacGgaAatmCaac 

YDR258C_MM_LNA3_5 1 5 TggmCct AggOaaAcgOtc AgcTtac 

YDR258C_MM_LNA3_566 TtgGaaAcaTcgmCggTgcGctTttt 

YDR258C_MM_LNA3_569 GaaAcaTcgGggAgcGctTttTcaa 

YDR258C_MM_LNA3_604 AaaAcgAcaGcaAaaGgcTttmCttc 

YDR258C_MM_LNA3_63 1 GacAgcmCtc AgtAaaTtgGccAcca 

YDR258C_MM_LNA3__686 mCcgAttAaamCgamCagAcaGtaTgct 

YDR258C_MM_LNA3_757 AtcAaaTagGaaAtcAgcTaaAgcc 

YDR258C_MM_LNA3_8 1 3 GacmCtaAgaAcaAaaAgcTggmCaat 

YDR258C_MM_LNA3_823 AtaAagmCtgGcaTtaGgtmCtcTttt 

YDR258C__MM_LNA3_870 AgamCgtAcaGcaAcaGaaAtaGcag 

AaaTagmCagmCaaAggmCctmCgtmCt 
YDR258C_MM_LNA3_888 tg 

YDR258C_MM_LNA3_890 AtaGcaGcaAtgmCccTcgTctTggc 
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Oligo name Sequence 

Oligo name Sequence 

TC 

YDR258C_MM_896 gmCaaTggcctmCgactggccaainCg 
TC a 

YDR258C_M M_LN A3_896 GcaAtgGccTcgActTggmCcaAcga 


Table 14. Duplex melting temperatures (Tm) for the 144 different 25-mer oligonucleotide 
capture probes. The design column denotes the sequence design of the probe. PM = perfectly 
matched probe, MM = single mismatch probe, LNA substitutions are depicted by capital letters, 
5 "^C denotes LNA methyl-C (SEQ ID NOs: ) 


Probe 
number 

Oligonucleotide target name 

Complementary target sequence 

Design 

Average 
duplex 
melting 
temp. (X) 

12696 

YDR258C_Tm_predic_043 

tttggtagcacgtcaagcttagtat 

MM-DNA 

59-2 

12699 

YDR258C_Tm_predic_078 

cactctaacagtatcgccgtttcta 

MM-DNA 

61,1 

12694 

YDR258C_Tm_predic_l 24 

gttgccatggagatcaaaatctgtc 

MM-DNA 

60 

12700 

yDR258C_Tm_predic_l 64 

tcaatggccttggaccatataattg 

MM-DNA 

59.5 

12693 

YDR258C_Tm_predic_201 

tcagttagccaaaccttcgcttcat 

MM-DNA 

61.4 

12698 

YDR258C_Tm_predic_249 

tttttcggccaatcgatcttgaatt 

MM-DNA 

58.8 

12702 

YDR258C__Tm_predic_295 

attgacctcaaatctttcttggata 

MM-DNA 

54.5 

12692 

YDR258C_Tni_predic_356 

gatgaactcaggaggataggatctt 

MM-DNA 

58.2 

12680 

YDR258C_Tm_predic_424 

accatcatcaccgaactttgtgtcg 

MM-DNA 

63.6 

12703 

YDR258C_Tni_predic_433 

cccaactttgtgacgtttaataaaa 

MM-DNA 

56.2 

12681 

YDR258C_Tm_predic_486 

caatgatcgtgtaacggaaatcaac 

MM-DNA 

59.3 

12682 

YDR258C_Tm_predic_5 1 5 

tggcctagggaaacggtcagcttac 

MM-DNA 

65.2 

12683 

YDR258C_Tm_predic_566 

ttggaaacatcgcggtgcgcttttt 

MM-DNA 

61.5 

12684 

YDR258C_Tmj5redic_569 

gaaacatcggggagcgctttttcaa 

MM-DNA 

63.8 

12701 

YDR258C_Tm__predic_604 

aaaacgacagcaaaaggctttcttc 

MM-DNA 

58.8 

12685 

YDR258C_Tm_predic_63 1 

gacagcctcagtaaattggccacca 

MM-DNA 

64.8 

12686 

YDR258C_Tm_predic_686 

ccgattaaacgacagacagtatgct 

MM-DNA 

57.1 

12687 

YDR258C_Tm_predic_757 

atcaaataggaaatcagctaaagcc 

MM-DNA 

54.3 

12688 

YDR258C_Tin_predic_8 1 3 

gacctaagaacaaaaagctggcaat 

MM-DNA 

58.2 

12697 

YDR258C_Tm_predic_823 

ataaagctggcattaggtctctttt 

MM-DNA 

58.4 

12695 

YDR258C_Tm_predic_870 

agacgtacagcaacagaaatagcag 

MM-DNA 

61.2 

12689 

YDR258C_Tmj)redic_888 

aaatagcagcaaaggcctc gtcttg 

MM-DNA 

64 

12690 

YDR258C_Tm_predic_.890 

atagcagcaatgccctcgtcnggc 

MM-DNA 

62.9 

12691 

YDR258C_Tm_j)redic_896 

gcaatggcctcgacttggccaacga 

MM-DNA 

68.5 
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1 1 UUt? 
1 lUI IIUKSI 

V^ilUiJI lUUICVJliUC ICIi^^^ MCIIIIC? 

Pnmnl^impntarv/ tijrnpt QpniiPnpp 
oui 1 ipioi i ici iioi y idi^d ocv^ucsi IV/C7 

i-Zcoiy 1 1 

Hi inlpy 

mpitinn 

1 1 Idlll lU 

12720 

YnR9S8r Tm nredic 04'?T 

TTTooTpor arotr an or TTpoTat 

MM-T 

68 6 


YnR9S8r Tm nrpdir 078T 

I L/IVZ^OV.^ 1 III piCUIL' \3 1 O i 

rJirTrTu uruotntr orr oTTTrTsi 
Lav 1 C I aaL<aKldlCgCCg 1 1 1 C 1 a 

MM-T 

IVliVl- 1 

60 7 

1 ^ / I o 

YnR9S8r Tm nrpHtr 1 94T 

p-p<-p i-p ciCk\r*CkCi Q iiTf*ToTr 
^11 ^CWa J ^^a^aVy^aaatX 1 L> 1 g 1 W 

MM.T 

IVllVl* 1 

60 9 

1 9794 

I ^ /ZH 

YnR9SRP Tm nrf»Hir 1 64T 

Tr Q o Ter or r TTo oo r r a Tq Tq Q TT o 
1 Cda J ggCC 1 I ggaCCa 1 d 1 da 1 ■ g 

MM-T 

60 7 

19717 

YnR9S8P Tm nrprfir 901 T 
I L/ivzjo\^ 1 III picmc zui 1 

Tr Q oTTq o r r Q a Q rr TTr orTTr nt 
1 vdg 1 1 dg^i/daawi< 1 1 ('gC I 1 Vrdi 

MM-T 

IVIIVI* 1 

60 4 

1 9799 
1 z / zz 

YnR9S8P Tm nrpHir 940T 

I 1-/|\Z JOV-. 1 111 piCUlL ZH^ 1 

Tt 1 1 ' 1 V" fifrr'i^o ot/*nQT/* 11 oooTf 
1 L 1 1 1 LggLLaoLL^a 1 L 1 1 ^aa 1 I 

MM T 

iVllVl- 1 

6S 0 

19776 
1 z / 

I i-/rvzjov_^ 1 III uicuit^ 1 

Q TT o arr Tr a n a fr TTTr TT o o I* Ta 
all gavC 1 CadalV 1 1 1 c 1 1 1 d 

MM-T 

IVllVl- 1 

6S 1 

O J. 1 

1 9716 

YnR9SRr Tm nrf»Hir '^S6T 

J LyivZJOV_^ 1 111 piCUlv JJUl 

1 gddC I Cdggdggd 1 dggd 1 C 1 I 

MM-T 

1 VI IVl 1 

64 0 

1 9704 

YnR9S8P Tm nrpHir 494T 

I L/rvZJov^ 1 III piCUlC HZH 1 

aCCd 1 l^d 1 Cdi^LgddL 1 1 1 g 1 g 1 eg 

MM-T 

iVllVI- 1 

74 

1 9797 

1 Z / Z / 

YnR9S8r Tm nrpHir A'K'XT 

r»r»/^QQ/^ III oTftQr*o' III qqTq q a a 
UCvddu 1 1 1 g i gdCg ill dd i dddd 

MM-T 

f\f\ s 

OO. J 

197ns 

1 Z / UJ 

YTiR9SSP Tm nrpHir 486T 

I l^fVZJOV-._l 111 picuic ^T-OO 1 

raaTcraTroTcrtaar ocraaaTraar 
^dd 1 gd 1 L'g 1 giddCggddd 1 ^ddw 

MM-T 

6S 9 

19706 

1 Z /UO 

YnR9S8P Tm nrpHir SI ST 

I I.^I\Z^OV^ Jill piCUIC JlJl 

Toorr To oo era a ar ooTr a or TTar 
1 ggCC J agggdddCgg 1 CdgC 1 1 dC 

MM-T 

IVIIVI- I 

71 6 
/ 1 .D 

19707 
1 z / V / 

YnR9SRP Tm nrpHir S66T 

1 L/IvZ JOw__ I III piCUIL> JUO 1 

1 1 "rtfiQ aa/»oT/»0'/»ooT^<Tr#»orr*' l~l"l~l'l' 

1 * ggaaaca i cgcgg i gcgc i i i 1 1 

MM-T 

IVJIVI- I 

68 6 

1 9708 
iz /UO 

YnR9SRP Tm nrpHir S60T 

I \ji\jloo\^_ I iii_prcuic joy 1 

gaaaca i cggggagcgc i j 1 1 i caa 

MM-T 

IVllVl- 1 

60 0 

1 979S 
Iz /ZJ 

VnR9SSP Tm r»rpHir 604T 
I \Ji\jLjo\^_ 1 iJi__prcuic_ou*i 1 

aaaacgacagcaaaaggc i 1 1 c 1 1 c 

MM-T 

6S 0 

12709 

YDR258C_Tm_predic_63 IT 

gacagccTcagtaaaTTggccacca 

MM-T 

68.8 

ill \\J 

Y UKz-)oL._ 1 m_preaic_ooo i 

ccgaTTaaacgacagacagTaTgct 

IVllVl- I 


1 971 1 
1 Z / 1 1 

I ujvzDol-- 1 m prcQic ijii 

aTc aaaTaggaaatc agcTaaagcc 

\A\A T 
IVllVl- 1 

S8 1 
JO. 1 

1 771 7 
J Z / 1 Z 

I UKz3ov^_ 1 m prcQic o i j \ 

gaccTaagaacaaaaagcTggcaat 

\A\A T 
IVllVl- 1 

0 1 . 1 

19791 

1 Z /Z 1 

YnR9S8P Tm nrpHtr 89'^T 
T l^rvZ Jol^ 1 III piCUItf OZJ 1 

'*ToOOftr*Tftft/*OQOQ ftftT/^T/*' 1 1 ' 1 ♦ 

d 1 ddagC 1 ggCddddgg ICli/1 1 11 

\4\A T 

1 14-14 
1 j~ It 

67 9 
o / .z 

1 971 0 

I Z / 1 7 

YnR9S8P Tm r\T(^d\c 870T 
I \Js\£,joK^_ 1 iTi_prcuic_o / u 1 

(1 fto/» ftT*o <^ Q ft/* Q o Q fta o aHPo ft/*o ft 

dgacg J dcagcddcdgddd i dgcdg 

MM-T 

IVllVl- 1 

6S 

1 971 '\ 

I Z / 1 J 

YnR9S8P Tm nrpHir 888T 

I lyivz j0v^_ 1 m_prcoic ooo i 

a n n ' 1 'o ft/»<»rtr»<i€Sfaftft/«/**T'/»ftT/»TTft 

aaa i dgcdgcdadggcc i eg i c i i g 

MM-T 

lYllVI- I 

70 4 

1 9714 

1 Z / 1 1 

YnR9SRP Tm nrpHir 8Q0T 

qT^q rr^Q f»r»Q QTrrr o/*Tr»rtTr'l ~i oof 
a 1 dgCagCdd 1 gv.'VL 1 eg I e 1 1 ggCi 

MM-T 

IVllVl- 1 

71 

/ 1 . J 

1 971 S 

1 Z / 1 ^ 

YnR9S8P Tm nrpHir 8Q6T 
I LTvZJoV^ 1 III piCUIL 070 i 

gcaa J ggCC 1 CgdC l l ggCCddCgd 

MM-T 

IVliVl- 1 

7'^ 7 

1 9744 
1 z / 

YnR9S8P Tm nrpHir 04'^TP 

I L/JvZ JOV^ 1111 [JICUIL Wt J 1 

TtTrrrrTQ rttnP QtnP crt/^QQ rtTviPtTQ fiTat" 
111 gg 1 dglllV^dlllV.^gieddgllie^l 1 dg 1 dl 

MM -TP 

IVllVi- 1 

7"^ 

1 9747 

1 Z /*T / 

YnR9S8P Tm nrpHir 078TP 

rrtpQ rtiPf rYtPtoom^^Q rrf'if/^ftmP/'ftTl'TTTil'To 
lijV^dlllV^llllV.'lddlllV^dgldlCglllV^Cg 111 lllw 1 d 

MM -TP 

iVl i VI - 1 

7S 9 

/ J.Z 

19749 

1 Z /HZ 

YnR9S8P Tm nrpHir 194TP 

I l^IVZJOv^ 1 III piCUIC 1 ZH 1 Vrf 

oTtomPmPaTooaoatmPaanaTmPtoTr 

g J Igill^lllV^d 1 ggdgdllllOdddd 1 lll\^ig 1 \f 

MM -TP 

IVJ IVl - 1 

61 6 

1 z /*to 

YnR9SSP Tm nrpHir 164TP 

I 1-#I\.Z jO\^_ 1 III piCUIC 1 OH 1 \^ 

Tin P a a T o CTin Pr TTo o a rm P a Ta Ta a TT o 
1 lil\^dd 1 gglllv^C 1 I ggdCIIlv^d 1 d 1 dd 1 1 g 

MM -TP 

IVl IVl - 1 

74 8 

12741 

YDR258C_Tni__predic_,201TC 

tmCagTtagcmCaaacctTcgmCttniCat 

MM-TC 

70.6 

1 974A 
Iz O 

VnR9S8P Tm nr^Hir 940TP 

Tf TtTr« P ft ftT« P m P Q o t/» ft Q tm PtTft Q qT* 

J I J I J mv^ggrni-.iTii-^aaicgaiinL/i j gaa 1 1 

MM TP 

iVl 1 VI - 11^ 

71 
/ 1 

12750 

YDR258C Tm nredic 295TP 

aTTonmPmPTmPaaatrTTtmPTTotiaTa 
nil Koiii^'iiix^ 1 iiiv^aaciiw i i iiiiv/ 1 i i o 

MM-TC 

72.2 

12740 

YDR258C_Tmj3redic_356TC 

gaTgaamCTmCaggaggaTaggaTmCTt 

MM-TC 

70.4 

12728 

YDR258C_Tm_predic_424TC 

amCcaTcaTcacccaaccTtgTgtmCg 

MM-TC 
13+16 

70.2 

12751 

YDR258C_Tni_predic^433TC 

mCmCmCaacTTtgTgacgTTTaaTaaaa 

MM-TC 

67.6 
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Prob6 
numbGr 

Olioonunlpotidp taropt namp 

Vw^ll^wl 1 VlV/l^^ilVJ W ICII^wl IIOIIIw 

Comnlpmpntarv taropt «;pniipnnp 

Dpsinn 

Ax/pranp 
dunlex 

\J \Jt 1 v#\ 

melting 
temo f°C) 

12730 

YDR258C Tm_predic 515TC 

TppTnCcTapppaaacggTcapmCtTac 

MM-TC 

75.5 

12731 

YDR258C Tm oredic 566TC 

TTppaaamCatmCpcffpTpmCpctTtTt 

MM-TC 

72 

12732 

YDR258C Tm oredic 569TC 

paaamCaTmCffpffffaffmCffctttTtmCaa 

MM-TC 

74.8 

12749 

YDR258C Tm oredic 604TC 

aaaam t^ff am f^affm CaaaaffomCTtTmf^tTc 

MM-TC 

72 

12733 

YDR258C Tm nredic 631TC 

oamr^affmCcTcaotaaaTTppcmf^acmPa 

MM-TC 

77 4 

12734 

YDR258C Tm nredic 686TC 

mt^mf^ onTTa a a mf^ oaf* n oji m f^p oTflTomf^t 
iiiv.'iiix^ga 1 1 aaaiii^gavagaiiix^o^ 1 oi giiiv./i 

MM -TP 

IVl 1 VI I 

70.2 

12735 

YDR258C Tm nredic 757Tr 

aTm a ?t Xi^ o D a a n tm n om r^Xn a a crm f^f* 

a I lllV^uau 1 clg^ua.allll\^aglll^ 1 CL£ia^lll\^\^ 

MM-TC 

64 6 

12736 

YDR258C Tm nredic 8nTC 

i l^IV^ «J 1 111 IJl VVllV KJ I -J I 

oa mf^m f^'Xaa oa a mf^a an a a omr^Xocrmr^ n at 

^ulliv^lllV^ 1 aa^oalllv^aaaCiii^i J 1 i ^glliv.^<lul 

MM-TC 

71.4 

12745 

YDR258C Tm nredic 823TC 

1 JLXl Vitr^CJ V«/ 1 111 LflV'Vll\^ 1 \^ 

aXaaaomf^XcTomr^atta ooXmr^XcXXXt 

u 1 dcici^iiiv i g^iiiv-^diia^^ J iiiV" XVI 1 11 

MM-TC 

74 

12743 

YDR258C Tm nredic 870TC 

aoamf^oXamf^'aomf^aacaoaaaXa omfao 

ugaiiiv^^ 1 €iiiiv^(i^iiiv_^aavu^aaci i ugiiiv^a^ 

MM-TC 

73.6 

12737 

YDR258C Tm nredic 888TC 

aaaXa tymr^acrmr^aaaocrmOcTmr^oXctXo 

aaa i a^iiiV^<xgiiiV/(icl<i^giiix>/V i lll^g J Ci i ^ 

MM-TC 

79 3 

12738 

YDR258C Tm nredic 890TC 

aXa (Tmf^a oc a a TocccXp otmf^tXo Of 

MM-TC 

715 

12739 

YDR258C Tm nredic 896TC 

ocaatoropctmf^oannf^ttooccaacora 

MM-TC 

71 1 
/ J. 1 

12768 

YnR258r Tm nredic 04 PM 

1 L/J\X>'Ov^ 1 III UlvUIW r iVl 

tttoota OP a p oapa a optta otat 
III ggiagio ddf ^a\f€Ui^\> I ut^ia i 

PM-DNA 


15771 

YT)R258r Tm nredic 078 PM 
1 ixai J o 1 111 |jicuiw_v/ / o_ r IVl 

pnpfptaQpa otttpoppotttpta 
Lavii/ iaowagiiiC3V/Wgiiiv»ia 

PM-DNA 


1 ^ / yjKj 

YDR^SSr Tm nrpdir 174 PM 

ottoppatooacrttpaaaatptotp 
gll^WalggagllLaaaalC l^lV> 

1 iVl-l_/l>(/A 

f>S 8 

12772 

I l^IX^^OV^ 1111 Ul CU1L< 1 1 iVl 

tpaatoCTPPttQpappatata atto 
aa Ig^^ 1 LgC aUv a la Ida I 

PM-DNA 



YDR^SSr Tm nredir 701 PM 

1 JL/lvX JOv-- I 111 Ul CVJI^ 1 I IVl 

tpa crttaoppa atppttpopttpat 
IVaglla^WaalW Lie gv> Ileal 

PM-DNA 

UU. 1 

12770 

YDR^SSr Tm nrpHir 940 PM 

f tttt p o OP p a a a f* o a t p tt o n a tt 
llllieg^veaaaegalL'llgaall 

PM-DNA 

f^S 

1 5774 

YDR^SXr Tm nrpdir 90S PM 
I i^rvx^ov-' 1 til picuiL> z.yj fivj 

a tt or a pp tp a a a a ptttp tt CT era ta 
al igae V lwaaaaL< I lie llggala 

PM-DtsIA 

U 1 .o 

12764 

YDR9S8r Tm nredic "^Sfi PM 
1 LJi\^jo\^ 1 ill uivuic JJ\j r IVl 

oatoaapfpaootooataooatptt 

PM-DNA 

F IVl-L^l^/A 

64 

12752 

YDR258r Tm nredic 424 PM 
I ia/r\^a/ov^ 1 III |,Jicui^ *T^*T r ivi 

a pp a tp a tpa p pp a a pttt <rt otp o 
aeea le a leae eeaae m^igieg 

PM-DNA 
rivi-i^i^/^ 

67 S 

12775 

YDRlSSr Tm nredic 4"^! PM 

I iJl\^mJO\j 1 III UlCUIk^ *TJJ 1 IVl 

ppp a a ptttototp ottta a ta a a a 
L ee oue 1 1 ig i^ IV g I L ucut utaoa 

PM-DNA 
r ivi-L/i>i/\ 

61 

12753 

YDR258C Tm nredic 486 PM 

1 ±^m\,^i^J\j\^ 1 III IJIWUIV ^yJ\J 1 IVI 

c a a t £r a tc Pt Dtta c 0 oaa a tea ac 

eaai^aiv^igilcie^^aclaieclaw 

PM-DNA 

64.2 

12754 

YnR9S8r Tm nrpHir SIS PM 
I ur\.Auo\^ 1 111 |jicuiv< I J F ivi 

tooppta oooa atpocrtpa opttap 

PM-DNA 

70 1 

12755 

YDR2S8r Tm nredic S66 PM 

I Lyi\X-JtJ\^ 1 111 L/i \^UiV ^\J\J J IVl 

ttofjaaapatpooocrtopopttttt 

PM-DNA 

1 iVl L/lT|/A 

70.8 

12756 

YDR9S8r Tm nredic S60 PM 
1 ui\.^JO\^ I 111 |JI wuic mjyjy r ivi 

oaaapatpooootopoptttttpaa 

PM-DNA 

68 7 

12773 

YDR258C Tm nredic 604 PM 

aaaapoapacxpataaooptttptfp 

aaaov^a V a^ va iaaggw 1 1 i iv 

PM-DNA 

64.5 

12757 

YDR258C Tm nredic 631 PM 

A .L^JX^«/V\^ J 111 L/i V%liv 1 M iVl 

oaca opctca otta attooppappa 

guv w i-v OK*- laa i> i Kg w ctv w a 

PM-DNA 

1 IVl L^l ^ / \ 

70.2 

12758 

YDR258C Tm oredic 686 PM 

1 l^IV^^(J\^ J III L/lvUiw \J%J\J 1 iVl 

cc pattaaac a aff aca otat ffct 

PM-DNA 

J ITl l»^l^*^ 

65.4 

12759 

YDR258C Tm oredic 757 PM 

J A^lV^«i/U^^ J ill L/l wVliw t ^ i 1 iVl 

atcaaataffgaattca^ctaaapcc 

PM-DNA 

61.5 

12760 

YDR258C_Tm predicts 1 3_PM 

sacctaa&aacataaaecte&caat 

PM-DNA 

63.5 

12769 

YDR258C_Tm predic_823_PM 

ataaagctggcaataggtctctttt 

PM-DNA 

63.6 

12767 

YDR258C^Tm predic J70^PM 

agacgtacagcatcagaaatagcag 

PM-DNA 

66.6 

12761 

YDR258C_Tm predic_888_PM 

aaatagcagcaatggcctcgtcttg 

PM-DNA 

69.2 

12762 

YDR258C_Tm predic_890_PM 

atagcagcaatggcctcgtcttggc 

PM-DNA 

72.7 
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Probs 
nuniucr 

wiiyoi luuicuiluc laiyci riarnc 

v^ornpierneniary laiyei bcCfUcncs 

^1 AO inn 

Average 

moltinn 

1 97/^'^ 
J Z /Oj 

YnR7S8P Tm r\r(^(\\r 80^ PM 
1 i-/r\.zjov^ 1 III picuic oyo r jvj 

gc aaiggc c ic gic uggcc aac ga 

PM ni\iA 

77 R 
/Z.o 

J z /yz 

VTiP7^8P Tm rkroHir* fldlT P\/l 
I UlvZ Jo\-/__ 1 in preaiC__trf J 1 _JrW\ 

ril ggTagcacgacaagcTTagTat 

PM T 
rivi- 1 

7/1 7 

IZ / 7J 

I uivz joi^_ 1 in preuic__u / 0 1 rivi 

L«aC I CI oaCagi I iCgCCg I 1 I C 1 a 

PM T 

7S ^ 

Iz / VU 

Vr^D7^fiP Tm v%«>A/1i/* 1 74T P\4 

T UKZ joi-'_ 1 m preQic_ i m i _r m 

g J TgccaTggagTtcaaaaTcTgTc 

PM T 

lA 7 

1 Z /7O 

VTiPO^fiP Tm rir<»/1i/» 1 A4T PM 

I UKZ jov-'_ 1 m prcaic_ i i ivi 

rcaaTggccTTgcaccaTaTaaTTg 

PM T 
r IVl- 1 

lA 

1 77CO 

1 z /©y 

Vr^P7^QP Tm rkfaAtf^ 70 IT P\il 

1 L/K.zDoL'_ 1 m preaic_zu 1 1 _r ivi 

1 cag 1 J agccaa i cci i cgc 1 1 cat 

PM T 

tJ. / 

1 Z / 7*t 

VTiP7^fiP Tm nr<>Hi/» 740T PM 

T LilvZ jo^_ 1 m prcuic_Z'iy i ivi 

rtTTTcggccaaacgaTcTTgaaTt 

PM T 
rlVi- 1 

71 
/ 1 

1 770Q 
1 Z /Vo 

VF>P7^QP Tm TM-oHir> 70 ^T PK/I 

Y UKZjoL^_i m preaic_zyj i rivi 

aTTgaccTcaaaacTTTcTTggaTa 

PK/I T 
r IVl- 1 

lf\ 7 

1 Z /oo 

YUKzjoi-^ I m preaic jjoj rM 

gaTgaac Tc aggTggaT aggaTc Tt 

D\A T 
rlVl- 1 

7 1 
/ 1 

1 Oil fx 

I uivZ3ov^_ 1 m preaic_HZ'+ 1 _rivi 

accaTcaTcacccaac Fl TgTgTcg 

PK/I T 
rlVI- 1 

70 7 

/y.z 

1 7700 

1 z /yy 

I L/IvZjoL^^ 1 ni picGlC ^JJ 1 r M 

ccc aac TTTgTgTc gTTTaaTaaaa 

PM T 
riVl- J 

7^ 1 
/ J. 1 

17777 

iL/tvzJov^ 1 rn prcQic *+ooi iivi 

caaTgaTcgTgtTacggaaaTcaac 

PM T 

77 7 
/Z.Z 

1 777Q 
IZ / /o 

VT>P7^fiP Tm rkroHio ^ 1 ^T PM 

I LIKz jol^_ 1 Tn prealC_j I j l _r IVl 

TggccTagggaaTcggTcagcTTac 

PM T 

7A O 

/o.y 

1 777Q 

IZ / /y 

Y UKz joC_i m preoic_joo i _r jvi 

TTggaaacaTcggggTgcgcTTTTt 

D\>i T 

rM-l 

7A 7 

1 z /oU 

Y UKz joL._ J m preciic jov I rjvi 

gaaacaTcggggTgcgcTTTTTcaa 

r>\A T 
rM- I 

/o 


YuivZDoL/ 1 m preaic ou*fi rivi 

aaaacgacagcaTaaggcTTTcTTc 

rM- 1 

71 7 
/ I . / 

1 0 70 1 

Iz /ol 

YUKz joC_i m preaic_DJ i i_r JVi 

gacagccTcagtTaaTTggccacca 

T>\A T 

rM- 1 

7C /I 

1 7707 

Iz /oz 

YUKZJoL-_l m preGlC_OOD l_rlvl 

ccgaTTaaacgagagacagTaTgct 

JrM-l 

77 7 
/Z.Z 

1 7701 
1 Z /oJ 

VT>T3 7^0^^ Tm ni-oHi/^ 7^7T \>\A 

Y jjJvzDoC_ 1 m preaic_ /d / 1 _r ivi 

aTcaaaTaggaaTTcagcTaaagcc 

T>\A T 

rM- 1 

oo,j 

1 77Q/1 
Iz /o4 

VT\P7^fiP Tm wisHi^ 01 IT P\>l 

gaccTaagaacaTaaagc Tggc aat 

PK>I T 
rM- 1 

A7 ^ 

1 7707 

IZ /yj 

VT\U7^fiP Tm v\raA\f* 07 IT 1>\A 

Y UKz jo*^_ 1 m preaic_oz J i _r ivi 

aTaaagcTggcaaTaggTcTcTTTt 

DK>I T 
rM- 1 

7>1 7 

1 7701 

Iz /y 1 

Y jJKz joC__ I m preoic_o / u J _r jvi 

agacgTacagcaTcagaaaTagc ag 

Dli>l T 

rM-l 

7 1 >1 
/ 1 .H 

1 770^ 

Y LIKzDoC_ 1 m preciic_ooo l _r IVi 

aaaTagcagcaaTggccTcgTcTTg 

D\A T 

rM-l 

77 7 
/ /.Z 

1 778A 
IZ /oO 

Y LiivzDov'_ 1 m prcaic_oyu i _r jvi 

aTagcagcaaTggccTcgTcTTggc 

PM T 
rM- 1 

on 1 
oU. I 

1 7707 
1 Z /o / 

VnP7^0r' Tm «r<iH^^ OQAT DM 

Y L/KzDoL'_ i m preoic_oyo I _r IVI 

gcaaTggccTcgTcTTggccaacga 

1>\A T 
rM- 1 

on 0 
oU.o 

1 70 1 A 
1 Zo 1 0 

VT^P7^QP Tm nr-f>A\r' fXAITH P\4 

Y L/KZ joi^ 1 m preaic lh j i r ivi 

TtTggTagmC am Cgam C aagm CtT agTat 

D\A TC 
rM- 1 

70 

1701 0 

i z<5 J y 

vr\i?7^op Trv» t-»»-c»riir. n70Tr' \y\A 
YUKzjov^ 1 m preaic w/oiL^ riVi 

mCamCtmCtaamCagtTtcgmCcgTtTmCTa 

D\A Tr" 

rM- 1 U 

0 1 Q 

o i .y 

1 701/1 
1 Zo J*t 

Vr>P7^0P Tm r»i-<aHi*^ 17ilTP P\/l 

Y uivZjov^_ 1 m preaic_ J i i^_r jvi 

gTt gm C m C aTggagTtmC aaaaTm CtgTc 

PM TP 
r M - 1 

70 1 

/o.J 

J ZoZU 

Y i-/jvz jov^_ I m prcaic_ J Oh i v_,_r ivi 

i iTiL^aa 1 ggm^^c i i gmi^acnii^a i a i aa i i g 

PM TP 
rM- 1 

oj.D 

1 7R1 
1 Zol J 

VnP7^RP Tm nrpHio 701 TP PM 

I \jw\^jo\u 1 m preaic zui iv^ rivi 

I uiL/ag 1 lagcini^aa i cini^i j iii^gini-/nniL^ai 

PM TP 
r IVI- 1 ^ 

fil 

O 1 .o 

1 7R1 8 
1 Zo 1 0 

Y uivzjo\-/__i m preaic__z*fy i i-^_rivi 

mil niL^ggnii^niL/aaaiiii^gainii^i i gaa 1 1 

PM TP 
r M- 1 

77 ft 

12822 

YDR?SJ^r Tm nrerlir 7QSTr PM 

aTTofimi ml Tm I 3 ri a ami 1 1 tmP 1 1 oonTn 

a 1 lp,ulll 1 1 1 III 1 v^uuddl 1 1 1 1 1 1 1 I 1 t^^,u 1 a 

PM-TC 

75.6 

12812 

YDR258C_Tm predic_356TC_PM 

gaTgaamCTmCaggTggaTaggaTmCTt 

PM-TC 

77.3 

12800 

YDR258C_Tm predic_424TC_PM 

amCcaTcaTcacmCmCaaccTtgTgtmCg 

PM-TC 

74.2 

12823 

YDR258C_Tm predic_433TC_PM 

mCmCmCaacTTtgTgTcgTTTaaTaaaa 

PM-TC 

74.8 

12729 

YDR258C_Tm_predic_486TC_PM 

mCaaTgaTmCgTgttamCggaaaTmCaac 

PM-TC 

75.5 
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1 1 KJUKS 

nijmhpr 

Olinnni jrlpntiHp tarnpt namp 

nnmnlpmpntarv tarnpt <^pniipnrp 

np^inn 

r^vci aye 

duolex 
melting 
temo (°C) 

12802 

YDR258C Tmpredic 515TC PM 

To prnPcTappeaaTmCppTcaCTnCtTac 

PM-TC 

83.6 

12803 

YDR2^8r Tm nredic S66Tr PM 

1 L/s\^JO^ 1 111 pi wuiv J\j\j I r ivj 

TTooaaamPatmPooooTomPoftT'tTt 

J 1 g^aaalllv^allliv^gggg I ^lilv^g^l l 1 1 l 

PM-TC 

80.8 

12804 

YDR258C Tm oredic 569TC PM 

paaamCaTmCppopTpmCffctttThnPflJi 

PM-TC 

83.2 

12821 

YDR2S8r Tm nredic fi04Tr PM 

a fl n a m P oa mPn otTi Pa Xn Ji o OTnPXtXTnf^tXf 

CLCLVUX IIIV^^cllll wq^l 1 1 <1 1 (l(lgglll\^ 1 I J lll\^l 1 W 

PM-TP 

7Q 

12805 

YDR258C Tm nredic 631TC PM 

oa mPa omPpXca otXa aXXocpmPamiPa 

^fXllt\^<XKAHK^\f 1 VCIK> ' ACk 1 1 K&VIIIV^ClVIIIV^Cl 

PM-TC 

82.5 

12806 

YDR258C Tm nredic 686TC PM 

I rJm\^JO\^ 1 111 ^IvUlV vOU 1 J IVI 

mPmPoaXXa aamP oa oa oanriPa oTaXomPt 
iii\^iii^^a 1 i <iciaiii^go^agaiii\^ag i o J ^iii\^i 

PM-XP 

79 4 

12807 

YDR258C Tm nredic 757TC PM 

aXmPaa^TapcraaTtmPacTmPTaaaomPr 

PM-TC 

71.4 

12808 

YDR2S8r Tm nrpHir J^HTP PM 
1 i_v ivz. J o 1111 uj c-uic o 1 J 1 r ivi 

onmPmPXn Jioii amPjiXji^n CTmPXoomPant 
^alllV^lllV^ 1 au^adlll^a i aao^illw i ^^lllV^ydul 

PM-TC 

78 0 

12817 

YDR258C Tmpredic 823TC PM 

aTaaHgmCTggmCaaTaggTmCTcT lit 

PM-TC 

81.2 

12815 

YDR258C_Tm predic_870TC_PM 

agamCgTamCagmCaTmCagaaaTagmCag 

PM-TC 

81.9 

12809 

YDR258C^Tm predic_888TC_PM 

aaaTagmCagmCaaTggmCcTmCgTctTg 

PM-TC 

86.8 

12810 

YDR258C_Tm predic_890TC_PM 

aTagmCagcaaTggmCcTcgtmCtTggc 

PM-TC 

83 

12811 

YDR258C_Tm predic_896TC_PM 

gcaatggcctmCgTmCttggccaacga 

PM-TC 

79.3 


Example 52. Performance analysis of LNA substituted oligonucleotide capture probes 
designed to detect splice variants in complex RNA pools. 

A. Oligonucleotide design for microarrays. The methods for designing exon-specific internal 
oligonucleotide capture probes has been described in example 2. 
Al. Design of the LNA-modified capture probes 

For the internal LNA-modified oligonucleotide capture probes, every third DNA 
nucleotide was substituted with an LNA nucleotide. The probes designed to capture the splice 
junction of the recombinant splice variants were designed with LNA substitutions at every third 
nucleotide position. All capture probes are shown in Table 15. 

Table 15. Internal, exon-specific and merged, exon-exon splice junction specific oligonucleotide 
capture probes used in the example. Capital letters denote LNA nucleotides and "^C LNA methyl- 
1 5 cytosine (SEQ ID NOs: ) 


EQ No 

Oligo Name 

Sequence 

10716 

>gene78.01a 

cctgaaagtagatttgttatttccgaaacgccttctcccgttcttaagtc 

10717 

>gene78.01b 

catataccacaaatagtGcctcaaaaatcacaaqaaaactcacaacactq 
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CO Mrk 

^lino Klama 
VilllyO INalTIc 



^>n£inQ7Q 0*50 

<-*gene/ cs.uoa 

ganigcagcggiggiaadddgidiyddadoyiyy laaiiaaaay 


^gene/ o.uoD 

CCaaigadadCladlCdddgyidddUy lyydiouudiyyLrdduuuoyyy 


>yene / o.rnu i u*5 

ca(^acacigcccagaggu(^aicgdiaaaiaigigadggdddig(A« 


^gene# o.mul iinoo 

c^acacigccc^gaggucaaicgaicc^aiyaicui 


>genG/o.nriir>JooUo 

giccagiaicgiccaicaiagiaicgataaaiaigiga 


>genef o.iiNoo 

cicciiciigcaucuCdacuccucaacacugagcggo 


>gene^ O.rnu 1 1 In o4 

caacacigcccagaggucaaicgaigiyigaiaygaicagiyucaygg 


>genef o.miiNo4uo 

yaaggcgaaggagacigciaaiaicgaiaaaiaiyigaaygaaai^ 


>gene/ o.iiNo4a 

yaacgiaigagcaigcgagagacgcigiaguggaoaaacccdLAjd^ 


>genef o.iNo4D 

gaaaccgcigaiiaiacigcggagaaggigggigagiaiadagaciaiac 

A A OA 

>penef <5.u la 4U 

aagiaganignauiccgaaacgccucicccgncn 

no4o 

>gene/o.ui d_4u 

accacaaatagtccctcaaaaatcacaagaaaactcacaa 


>gene/o.uoa_4u 

gcagcggiggiaaaaagiaigaaaacgiggiaanaaaag 

1 1^54o 

>genefo.uoD 4U 

gaaaactaatcaaaggtaaacgtggatcccatggcaattc 

4 4 Q>|0 

>gGnG f o.min uo_4u 

/^<*i"»4#^/^r^/^'fij^'^/^#^44^«fi<54/^^ oi4*^ o '^4*^4/^4^ o o /*i o *34 

cacigcccagaggiicaaicgaiaaaiaigigaaggaaai 

4 4 ocn 

>gene / o.mu 1 1 iNoo_4u 

#^4^ r^^r^'^#^^^M44#^^^4#%^ ^4^^^^4^^4^^4^^4^'3 

cig ccca gaggncaaicg aiccg aig a iccia aiy aagg 


^/-ii=kf-»«7Q miMCono /in 
>gene/ o.miNooUo_4U 

gtatcgtccatcatagtatcgataaatatgtgaaggaaat 

4 4 OCO 

^<-ii^rk«7Q IMCO ACX 

4x>44^^ ^44 ^44rx^ ^^44^^44/^ ^ /^44 /^/^ ^ *^ 4^ 

iciig caiiciica a cnccuca a ca cng ag eg g a g ic 

4 <4 OCO 

>vi^/^i-iA7Q mAi IKIC>I Af\ 

>genef o.mu 1 hno4_4U 

ctgcccagaggttcaatcgatgtgtgataggatcagtgtt 

4 4 OCA 

>genef o.miiNio4Uo_4U 

cgaaggagactgctaatatcgataaatatgtgaaggaaat 

4 4 OCC 

>genef o.iiNo4a_4u 

tatgagcatgcgagagacgctgtagttggaaaaacccacg 

4 4 OCC 

1 loob 

>genef o.iNb4D_4U 

cgctgattatactgcggagaaggtgggtgagtataaagac 

4 4 OC7 

11357 

'>.nAnj-k70 n4»-» Cr» 1 MAO 

>genef o.uiaoULNAo 

«.,,x — _ A —,4 A — ^Tt+Z^ 44 A ♦4T/^/-»/^ »^ A ^r^rr»^/^4T/^4rYi/^r»o^f4rr»r^44 A o/^T/"- 

mucioaaAgiAga i uouAu \ ccoaaAcgmuci i cimucconmuiiAag i c 

4 4 OCO 

1 looo 

>gene / o.oi d_oO_ln Ao 

mLratAiamL/CamLraaAtaoicmucimoaaAaa i camuaaoaaAac i camoaamv^ac i g 

4 4 OCrt 

1 1 009 

*^/-.^rt^70 AO« CA 1 KIAO 

>gene/o.Uoa du linAo 

oat 1 igmL/agmugg i gg i aaAaaoia i gaAaamugiogiAai i aaAagoic \ c 

4 4 OCA 

1 1 JbU 

•*^nnnn7Q AOK CA 1 KIAO 

>gene/o.UoD_ju_LiMAo 

mucaAigAaaActAaimL'aaAgg i aaAcg i ggAicmL/ca i ggmuaa i icmucgog 

4 4 004 

11361 

^MMm^70 »^A4AO CA 1 KIAO 

>gene7o.muiUo ou lnao 

inCacAacActGccmCagAggTtcAatmCgaTaaAtaTgtGaaGgaAatGccTg 

4 4 OCO 

11362 

— -.70 mA'llKICO CA 1 KIAO 

>genef O.mu 1 inoo__ou_lnao 

moaamoac i gcmucaoagoumL>aa i cgAicmuga i ga i cc i aa i gaAggmugcmuc 

4 4 OCO 

1 13do 

-,-70 -v\iMCOAO CA 1 MAO 

>gene/o.miNooU3_oU_LNAo 

oicmL/ag i aimugtmuca i ca i ag i aimuga i aaAia i gioaaogaAaiocc i g 

4 4 OC /I 

1 1od4 

-s/-ii-i»-»rt7Q IMCO CA 1 MAO 

>gene/o.iNoo_ou_LiNAo 

m04^r^^4im^44/^<^»^T4^T4^ A r-,^T"4/^rr»044ror^ o orr^r^ o ^Tf/^ A r^n-f^r^TiC^^ f^(~l.rt\r^r^ 

mL/icmunmLrUoca i ic i iCAac i icmuumL/aamuac i igAgcogaoxcogxoc 

1 lobo 

/-I /-s r-1 /-» 7 D mA'IIKIC'yl CIA 1 KIAO 

>gene/ o.mul ino4_ou_lnao 

mL/aamL/ac i gcmucaoagoiimuaa i cgAig i gioaiAggAicAgioiimL^agog 

4 4 OCC 

11366 

^nnnn7Q mlKIOjiAO CA 1 KIAO 

>gene / o. m hno4Uo_ou_lin ao 

oaaogcoaaogaoac i gc i aa i aimoga i aaAia i gioaaogaAaiocc i g 

4 4 OC7 

1 1367 

>g8ne/ o.iiNo4a ou linAo 

oaamLrgiAigAgcAigmugaoagAcgmL/ig i ag i igoaaAaamuccAcgAagmug 

4 4 OCO 

113bo 

-^r^r\r~\t-\~7 Q IKIC/tK t^A 1 KIAO 

>genG / o. 1 no4d_ou_lin ao 

^ A r^r^f^ /-»4/^ »^+"T"^* A r>k(^ r^r^(^ A o<-i^4n/^/^4/^onT?34 A <io/^0/*To4 Ar« 

oaaAccoCioai i aiACiocgoagAagoigogioag i aiAaaoac i aiAc 

1 loby 

'^<^Ano7Q A1 «!» ylA 1 KIAO 

>gGnG/ o.ui a_4U_Ll>iAo 

aAgiAga i uoHAu i ccoaaAcgmuci i cimv-rCCoumL^u 

A A 07A 

^n/?ir»<=k7Q Ai K AH 1 KIAO 

>gGnG A o.ui D_4U_LinAo 

amocamL»aaAiaoicrnucimL/aaAaa i camLraaoaaAac i camL/aa 

4 4 074 

'!*ncknQ7ft AOo AH 1 KIAO 

>gGnG f o.Uoa_4U_L!NAo 

gmoagmugg i gg i aaAaaoia i gaAaarnugiogi/\ai i aaMag 

4 4 070 

•^i-i«rii=i7Q AOK AC\ 1 KIAO 

>gGnGf O.UoD_4U_LlNAo 

gAaaACiAaimLfaaAgg i aaAcg j ggAicmuca i ggmoaa i ic 

4 4 070 

*^/-iAnA7Q mAiAl >IA 1 KIAQ 

>gGne / o.mu i uo_4u_ln ao 

CACioCcmuagAgg i icAaimuga i aaAia i gioaaogaAai 

4 4 OTil 

1 1o/4 

-^t^r\nf^7Q mAilKICO Af\ 1 KIAO 

>gGnG/o.mui iinoo_4U_linAo 

c 1 gcmocaoagoiimoaa i cgAicmuga i ga i cc i aa i gaAgg 

1 1 0 1 \J 

'^yCl IC r O.I 1 IIINOOWO *t\J LINrAO 

nTsitmnntmPraTrflTanTfltmnnaTaaAtaTntCnflflCinaAat 

y 1 oil 1 iv>y 11 1 iv_/vyCi i v^o i oy i aiii iwyo i cicir^ici i ^iwoa v^v^a/^ai 

11376 

>gGnG78.INS3„40„LNA3 

tmCttGcaTtcTtcAacTtcmCttmCaamCacTtgAgcGgaGtc 

11377 

>gene78.m01 INS4_40_LNA3 

cTgcmCcaGagGttmCaaTcgAtgTQtGatAggAtcAgtGtt 

11378 

>gGnG78.mlNS403_40_LNA3 

cGaaGgaGacTgcTaaTatmCgaTaaAtaTgtGaaGgaAat 

11379 

>gGnG78.INS4a 40 LNA3 

tAtgAgcAtgmCgaGagAcqmCtgTagTtgGaaAaamCccAcg 

11380 

>gGnG78. 1 NS4b_40_LN A3 

cGctGatTatActGcgGagAagGtgGgtGagTatAaaGac 
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B. Printing and coupling of the splice isoform-specific microarrays 

The splice variant capture probes were synthesized with a 5' anthraquinone (AQ)-modification, 
followed by a hexaethyleneglycol-2 (HEG2) linker. The capture probes were first diluted to a 20 

5 |iM final concentration in 1 00 mM Na-phosphate buffer pH 7.0, and spotted on the Immobilizer 
polymer microarray slides (Exiqon, Denmark) using the Biochip Arrayer One (Packard Biochip 
Technologies, USA) with a spot volume of 2x 300 pi and 300 ^im between the spots. The capture 
probes were immobilized onto the microarray slide by UV irradiation in a Stratalinker with 2300 
^joules (Stratagene, USA). Non-immobilized capture probe oligonucleotides were removed fi*om 

10 the slides by washing the slides two times 15 min. in IxSSC. After washing, the slides were 
dried by centrifugation at lOOOx g for 2 min., and stored in a slide box until microarray 
hybridization. 

C. Construction of the Splice Variant Clones 

The recombinant splice variant constructs were cloned into the TriamplS vector 
1 5 (Ambion, USA). The constructs were sequenced to confirm their construction. The plasmid 
clones were transformed into E, coli XLIO-Gold (Stratagene, USA). 

Genomic DNA was prepared from a wild type standard laboratory strain of 
Saccharomyces cerevisiae using the Nucleon Mi Y DNA extraction kit (Amersham Biosciences, 
USA) according to the supplier's instructions. Amplification of the partial yeast gene was done 
20 by standard PCR using yeast genomic DNA as template. In the first step of amplification, a 

forward primer containing a restriction enzyme site and a reverse primer containing a universal 
linker sequence were used. In this step 20 bp was added to the 3 '-end of the amplicon, next to the 
stop codon. In the second step of amplification, the reverse primer was exchanged with a nested 
primer containing a poly-T2o tail and a restriction enzyme site. The SWI5 amplicon contains 730 
25 bp of the SWI5 ORF plus 20 bp universal linker sequence and a poly-A2o tail. 
The PCR primers used were; 

YDR146C-For-EcoRI: acgtgaattcaaatacagacaatgaaggagatga (SEQ ID NO: ) 
YDR146C-Rev-Uni: gatccccgggaattgccatgttacctttgattagttttcattggc (SEQ ID NO: ) 
Uni-polyT-BamHI: acgtggatccttttttttttttttttttttgatccccgggaattgccatg (SEQ ID NO: ), 
30 The PCR amplicon was cut with the restriction enzymes, EcoKl + BamWl. The DNA 

fi-agment was ligated into the pTRIamplS vector (Ambion, USA) using the Quick Ligation Kit 
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(New England Biolabs, USA) according to the supplier's instructions and transformed into E, 
coli DH-5a by standard methods, 

CI. Construction of the recombinant splice variant #1 (Triampl8/swi5-rubisco) 
The Arabidopsis thaliana Rubisco small subunit ssu2b gene fragment (gi 17064721) was 
5 amplified from genomic DNA by primers named DJ305 5'- 

ACTATGATGGACGATACTGGAC-3' (SEQ ID NO: ) and DJ306 5'- 
ATTGGATCGATCCGATGATCCTAATGAAGGC-3' (SEQ ID NO: ), containing CM 
restriction site linkers. The purified PCR fragment was digested with Clal and then cloned into 
the swi5 (gl:7839148) vector at the unique Clal site (atcgat) giving each insert a flanking 
10 sequence from the original yeast SWI5 insert (named exonOl and exon 03, see Figure 19). The 
product was inserted in the reverse orientation, so that the insert sequence is: 

atcgatCCGATGATCCTAATGAAGGCGCCCGGGTACTCCTTCTTGCATTCTTCAACTTCC 
TTCAACACTTGAGCGGAGTCGGTGCATCCGAACAATGGAAGCTTCCACATTGTCCAG 
1 5 TATCGTCCATCATAGTatcgat (SEQ ID NO: ) 

Nucleotide sequence analysis revealed a difference between the sequence of^. thaliana Rubisco 
expected from the GenBank database and that obtained from all sequenced constructs and PCR 
products. Position 30 in the Rubisco insert is C rather than the expected A. This SNP was 
20 probably created by PCR. None of the oligonucleotide capture probes used in the example cover 
this region. 

Rubisco seq. in genbank TCCTAATGAAGGCGCCA (SEQ ID NO: ) 
The sequence obtained 

from the plasmid contruct TCCTAATGAAGGCGCCC (SEQ ID NO: ) 
25 C2. Construction of the recombinant splice variant # 2 (Triampl8/swi5-lea) 

The Arabidopsis thaliana Lea gene (gi 1526423) was amplified from genomic DNA with primers 
named DJ307 5'-GGAATTATCGATGTGTGATAGGATCAGTGTTCAG-3' (SEQ ID NO: ), 
and DJ308 5^AATTGGATCGATATTAGCAGTCTCCTTCGCC-3' (SEQ ID NO: ), including 
the Clal linker sites as above. The PCR fragment was digested with Clal cloned into the yeast 
30 SWI5 IVT construct as above at the unique Clal site. 

The fragment was inserted in the forward orientation, resulting in the following insert sequence: 
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atcgatGTGTGATAGGTTCAGTGTTCAGGGCTGTCCAAGGAACGTATGAGCATGCGAGA 
GACGCTGTAGTTGGAAAAACCCACGAAGCGGCTGAGTCTACCAAAGAAGGAGCTCA 
GATAGCTTCAGAGAAAGCGGTTGGAGCAAAGGACGCAACCGTCGAGAAAGCTAAG 
GAAACCGCTGATTATACTGCGGAGAAGGTGGGTGAGTATAAAGACTATACGGTTGA 
5 TAAAGCTAAAGAGGCTAAGGACACAACTGCAGAGAAGGCGAAGGAGACTGCTAATa 
tcgat. (SEQ ID NO: ) 

Figure 1 1 shows the construction of the recombinant splice variants in the in vitro 
transcription vector. The small bars show the location of the oligonucleotide capture probes used 
10 in this example. The sequences of the capture probes are shown in Table 15. 
D. Preparation of target 

Dl. In vitro RNA preparation from splice variant vectors 

In vitro RNA from the splice variants were made using the MEGAscript™ high yield 

transcription kit according to the manufacturer's instructions (Ambion, USA). The yield of IVT 
1 5 RNA was quantified at a Nanodrop spectrophotometer (Nanodrop Technologies, USA) 

D2. Isolation of total RNA from C. elegans 

Strains and growth conditions: C elegans wild-type strain (Bristol-N2) was maintained 

on nematode growth medium (NG) plates seeded with Escherichia coli strain OP50 at 20 *^C, and 

the mixed stages of the nematode were prepared as described in Hope, I. A. (ed.) " C. elegans - 
20 A Practical Approach Oxford University Press 1999. The samples were immediately flash 

frozen in liquid N2 and stored at - 80 until RNA isolation. 

A 1 00 |Lil aliquot of packed C. elegans worms from a mixed stage population was 

homogenized using the FastPrep Biol 01 from Kem-En-Tec for 1 min, speed 6 followed by 

isolation of total RNA from the extracts using the FastPrep Biol 01 kit (Kem-En-Tec) according 
25 to the manufacturer's instructions. The eluted total RNA was ethanol precipitated for 24 hours at 

- 20''C by addition of 2.5 volumes of 96% EtOH and 0.1 volume of 3M Na-acetate, pH 5.2 

(Ambion, USA), followed by centrifiigation of the total RNA sample for 30 min at 13200 rpm. 

The total RNA pellet was air-dried and redissolved in 10 |Xl of diethylpyrocarbonate (DEPC)- 

treated water (Ambion, USA) and stored at - 80^C. 
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E. Fluorochrome-labelling of the target 

The following fluorochrome-labelled cDNA targets were synthesized to test the 
performance of 'merged' splice junction probes that encompass exon borders. Synthetic RNAs 
corresponding to three artificial splice variants; #1 (exon01-INS3-exon03 (1-INS3-3), #2 
5 (exon01-INS4-exon03) (01-INS3-3) and #3 without the middle exon (01-03) were spiked into 
lO^g of C elegans reference total RNA samples in various combinations and concentrations 
prior to fluorochrome-labelling with either Cy3 or Cy5 as indicated in Table 16. At the same 
time lO^g of C. elegans reference total RNA was labeled with Cy3 for control experiments. 
Hybridizations were performed with Cy3- and Cy5 labeled C elegans RNA + spike RNA mix . 

10 The details of RNA samples and synthetic RNA spikes are shown in Table 16. The RNA 

samples were combined in individual labeling reactions with 5 |ig anchored oligo(dT2o) primer 
and DEPC-treated water to a final volume of 8 |iil. The mixture was heated at 70^C for 10 min, 
quenched on ice for 5 min, followed by addition of 20 units of Superasin RNase inhibitor 
(Ambion, USA), 1 |iil dNTP solution (lOmM each dATP, dGTP, dTTP and 0,4 mM dCTP, and 3 

15 |xl of Cy3-dCTP or Cy5-dCTP, Amersham Biosciensces, USA), 4 |li1 5 x RTase buffer 

(Invitrogen), 2|Lil 0.1 mM DTT (Invitrogen), 400 units of Superscript II reverse transcriptase 
(Invitrogen, USA) and DEPC-treated water to 20 ^1 final volume. Background hybridization to 
merged capture probes was monitored with 10|ag of C. elegans reference RNA alone labeled 
with Cy3-dCTP; according to the labeling method described above for the splice variant spikes. 

20 All cDNA syntheses were carried out at 42^C for 2 hours, and the reaction was stopped by 
incubation at 70°C for 5 min., followed by incubation on ice for 5 min. 

Unincorporated dNTPs were removed by gel filtration using MicroSpin S-400 HR 
columns as described in the following: Pre-spin the column 1 min at 1500 x g in a 1.5 ml tube 
and place the column in a new 1 .5 ml tube. Slowly apply the cDNA sample to the top centre of 

25 the resin, spin 1 500-x g for 2 min and collect the eluate. The RNA was hydrolyzed by adding 3 
^1 of 0.5 M NaOH, mixing and incubating at 70 °C for 15 min. The samples were neutralized by 
adding 3 ^1 of 0.5 M HCl and mixing, followed by addition of 450 jil IxTE, pH 7.5 to the 
neutralized sample and transfer onto a Microcon-30 concentrator (prior to use, spin 500 |il IxTE 
through the column to remove residual glycerol). The samples were centrifuged at 14000-x g in a 

30 microcentrifuge for 12 min. Spinning was continued until volume was reduced to 5 ^1. The 
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labelled cDNA probes were eluted by inverting the Microcon-30 tube and spinning at 1000-x g 
for 3 min. 


Table 16. Synthetic splice variant RNAs spiked into C elegans samples*. 


Spike RNA 

Ratio 

Splice variant RNAs 

Observed ratio 

STDEV 

Observed ratio 

STDEV 

Expected ratio 

concentration 



LNA 50mer 

LNA SOmer 

LNA 40mer 

LNA 40mer 


lOOOppm 

5 

Cy3: spike 01-fNS3-03 

0.76 

0.05 

0.61 

0.19 

0.83 


1 

Cy3: spike 01-03 

0.24 

0.05 

0.39 

0.19 

0.17 


1 

Cy5: spike 01 -1NS3-03 

0,16 

0.04 

0.06 

0.03 

0.17 


5 

Cy5: spike 01-03 

0,84 

0.04 

0.94 

0.03 

0,83 









lOOOppm 

5 

Cy3: spike 01 -INS3-03 

0.77 

0.11 

0.68 

0.22 

0.83 


1 

Cy3: spike 01 -INS4-03 

0.23 

0.11 

0.32 

0.22 

0.17 


1 

Cy5: spike 01 -INS3-03 

0.12 

0.04 

0.11 

0.15 

0.17 


5 

Cy5: spike 01-rNS4-03 

0.88 

0.04 

0.89 

0.15 

0.83 









lOOOppm 

5 

Cy3: spike 01 -1NS3-03 

0.88 

0.08 

0.87 

0.10 

0.83 


1 

Cy3: spike 01 -INS4-03 

0.12 

0.08 

0.13 

0.10 

0.17 


1 

Cy5: spike 0MNS3-03 

0.22 

0.12 

0.15 

0,12 

0,17 


5 

Cy5: spike 01 -INS4-03 

0.78 

0.12 

0,85 

0.12 

0,83 









lOOppm 

5 

Cy3: spike 0MNS3-03 

0.89 

0.15 

0.11 

0.08 

0.83 


1 

Cy3: spike 0MNS4-03 

0.11 

0.15 

0.89 

0.08 

0.17 


1 

Cy5: spike OMNS3-03 

0,48 

0,2 

0,57 

0.31 

0.17 


5 

Cy5: spike 01 -INS4-03 

0,52 

0.2 

0.43 

0.31 

0.83 









lOppm 

5 

Cy3: spike 0MNS3-03 

0.61 

0.2 

0.09 

0.14 

0.83 


1 

Cy3: spike 0MNS4-03 

0.39 

0.2 

0.91 

0.14 

0.17 


1 

Cy5: spike 0MNS3-03 

0.34 

0.18 

0.19 

0.22 

0.17 


5 

Cy5: spike 01 -1NS4-03 

0.66 

0.18 

0.81 

0.22 

0.83 


* Parts per million (ppm) calculations indicate spike transcripts per total transcripts in the 
hybridisation mix. Calculations are based on an average C. elegans RNA being 1000 nucleotides 
as in Hill et al (2000). Science 290:809-812. 

10 F. Microarray hybridization 

The fluorochrome-labelled cDNA samples, respectively, were combined. The following 
was added: 3.75 |xl 20x SSC (3x SSC final, pass through 0.22 \i filter prior to use to remove 
particulates) yeast tRNA (1 |ig/^l final) 0.625 ^il 1 M HEPES, pH 7.0 (25 mM final, pass 
through 0.22 iifilter prior to use to remove particulates) 0.75 ^il 10 % SDS (0.3 % final) and 

15 DEPC-water to 25 ^il final volume. The labelled cDNA target samples were filtered in Millipore 
0.22 fi filter spin column (Ultrafi-ee-MC, Millipore, USA) according to the manufacturer's 
instructions, followed by incubation of the reaction mixture at 100 °C for 2-5 min. The cDNA 
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probes were cooled at room temp for 2-5 min by spimiing at max speed in a microcentrifuge. A 
LifterSlip (Erie Scientific Company, USA) was carefully placed on top of the microarray spotted 
on Immobilizer™ MicroArray Slide and the hybridization mixture was applied to the array from 
the side. An aliquot of 30 |iL of 3xSSC was added to both ends of the hybridization chamber, 
5 and the Immobilizer™ MicroArray Slide was placed in the hybridization chamber (DieTech, 

USA). The chamber was sealed watertight and incubated at 65°C for 16-18 hours submerged in a 
water bath. After hybridisation, the slide was removed carefiilly from the hybridization chamber 
and washed using the following protocol. The slides were washed sequentially by plunging 
gently in 2 x SSC/0.1% SDS at room temperature until the cover slip falls of into the washing 

10 solution, then in Ix SSC pH 7.0 (150 mM NaCl, 15 mM Sodium Citrate) at room temperature for 
1 min, then in 0.2 x SSC, pH 7.0 (30 mM NaCl, 3 mM Sodium Citrate) at room temperature for 
1 min, and finally in 0.05 x SSC (7.5 mM NaCl, 0.75 mM Sodium Citrate) for 5 sec, followed by 
drying of the slides by spinning at 1 000 x g for 2 min. The slides were stored in a slide box in 
the dark until scanning. 

1 5 G. Microarray data analysis. 

The splice variant microarray was scanned in a ScanArray 4000XL confocal laser 
scanner (Packard Instruments, USA). The hybridisation data were analysed using the GenePix 
Pro 4.01 microarray analysis software (Axon, USA). The C. elegans reference RNA alone 
converted to first strand cDNA and labelled with Cy3-dCTP did not produce significant 

20 fluorescence intensity signals from the LNA substituted spike RNA specific capture probes. 

Gl. A mathematical formula for analysis of the microarray data for alternative splicing 

One of the major limitations to comparative microarray hybridisation assays is that only 
identical probes can be compared between samples. Different alternative splice forms are 
detected using different probes, and this will tell directly if one splice form is more abundant in a 
25 given tissue compared to another. However, the estimation of the ratios between splice forms in 
a single tissue is not directly accessible. Given an example similar to that described below we 
employ the following calculations to calculate quantities of splice variants from array data. The 
theoretical justification is shown. To our knowledge this justification has not been used by any 
previous analysis. 
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The above scenario is tested in a comparative hybridisation, with two channels: I & II 
(signal from probe2 in channel I is called probe2(I), and so forth). Probe 1 hybridises to both 
splice forms, Probe2 hybridises to A only, Probe3 hybridises to B only. 
Since every transcript will hybridise to probel, and every transcript will hybridise to either 
5 probe2 or probe3, there exists some relationship between the following: 
probe 1(1) - {probe2(I) andprobe3(I)}. 
probel(II) ~ {probe2(II) and probe3(II)}. 

For simplicity we assume that systematic differences between channels have already been 
eliminated through normalisation, although this is not essential. 
10 We now imagine a factor (x) that will transform the signal of probe2 into a value directly 

comparable to probel. Likewise we imagine factor (y) for probe3. As long as we are not facing 
saturation in the hybridisations, the assumption of a linear relationship between absolute probe 
signals is reasonable. 

The introduction of variables x & y will give the following equations: 
15 probel(I) = (x)probe2(I) + (y)probe3(I). 
probe 1(11) = (x)probe2(II) + (y)probe3(II). 

Since all signals are measurable, the above is two linear equations with two unknown variables, 
that can easily be solved. Further the ratio between (x)probe2(I) & (y)probe3(I) will provide the 
ratio between splice form A and B in channel 1. Similarly, the ratio of (x)probe2(II) to 

20 (y)probe3(II) is used for channel II. 

Data normalization is not required for this method. 

In the above equations, probel denotes all probes that will hybridize to both spliceforms, 
probe2 denotes probes that specifically will hybridize to spliceform A but not B, and probe3 
denotes probes that will specifically hybridize to spliceform B but not A. 

25 In the case where two spliceforms consist of gene78 with two different inserts middle 

exons (INS3& INS4), probes can be grouped as in Table 20 (only LNA 40mers are considered 
here): 
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Table 20 


Probes that will hybridize to 
both constructs 

Probes that will hybridize to 
INS3 constructs only 

Probes that will hybridize to 
INS4 constructs only 

Gene78.01a_40_LNA3 

Gene78.INS3_40_LNA3 

Gene78. 1 NS4_40_LN A3 

Gene78.01b_40_LNA3 

Gene78.m01 INS3_40_LNA3 

Gene78.m01 INS4_40_LNA3 

Gene78.03a_40_LNA3 

Gene78.mlNS303_40_LNA3 

Gene78.mlNS403_40_LNA3 

Gene78.03b_40_LNA3 




The equations can be solved with any combinations of one representative from each 
5 probe group. This gives a total of 48 (4x3x3) possible ways of solving the equations. The 

estimated quantities of the constructs are given as the average of all possible solutions (equations 
yielding non-positive solutions are ignored). This was done for all comparative hybridizations. 
Note that when comparing with gene78 with no insert, only 12 equations are possible (The, since 
the artificial splice variant construct with no insert has only one specific probe). The results from 
10 analysis of the microarray hybridization data from the RNA pools spiked with different splice 
isoforms at different ratios and concentrations are shown in Figures 39 and 40. 
RESULTS 

Figure 39 shows detection of alternatively spliced mRNAs using LNA-substituted 50-mer 
oligonucleotide capture probes in a bar diagram format. Figure 40 shows detection of 

1 5 alternatively spliced mRNAs using LNA-substituted 40-mer oligonucleotide capture probes. 
Both 50-mer and 40-mer LNA-DNA mixmer substituted oligonucleotide capture probes, 
substituted with an LNA nucleotide at every third nucleotide position, were able to provide 
highly accurate measurements for fold-changes in the expression of three homologous, 
alternatively spliced mRNA variants in the concentration range of 1000 ppm to 10 ppm. The 

20 quantification of the splice isoforms was carried out using a set of both internal, exon-specific 
probes and merged, splice junction specific probes, printed onto microarrays and hybridized with 
complex cDNA target pools spiked with different cloned artificial splice isoforms in which the 
middle exon was either alternatively skipped or excluded completely resulting in the three 
different splice isoforms; 01-INS3-03, 01-INS4-03 and 01-03. 
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Example 53. Expression Profiling of Toxicological responses in Caenorhabditis elegans 
using LNA Oligonucleotide Microarrays and beta-naphthoflavone and primaquine as 
model compounds. 

The present patent example demonstrates the use of the Exiqon C elegans LNA tox 
5 oligoarray in gene expression profiling experiments in the nematode Caenorhabditis elegans. 
The C. elegans tox oligoarray will monitor the expression of a selection of 1 10 genes relevant for 
general stress response and for the metabolism of toxic compounds. Two different capture probes 
for each of these target genes have been designed, and included on the LNA tox array. In 
addition, the C elegans LNA tox oligoarray contains capture probes providing control for cDNA 
10 synthesis efficiency and the developmental stage of the nematode. Capture probes for 

constitutively expressed genes for data set normalization is also included on the C. elegans LNA 
tox oligoarray. 

A. Cultivation of C elegans worms 

For all cultures the sample is divided into two, and one half of the sample is used as the 
15 control, the other as the treated sample. Worm samples are harvested and sucrose cleaned by 
standard methods. For heat shock treatment: the heat shock sample is added to S-media 
preheated to 33°C in a 1 L flask suspended in a water bath at 33^C, the other sample is added to 
a 1 L flask with S-media at 25°C. Both samples are shaken at approximately 100 rpm. for an 
hour. For p-naphthoflavone and primaquine treatment: 0.5 mL of 5 mg/mL P-naphthoflavone in 
20 DMSO or 0.5 mL of 20 mg/mL primaquine in DMSO were added to each 500 mL volume of S- 
media culture after 28 hours of growth from LL At the same time 0.5 mL of DMSO was added 
to the control. Incubation was for 24 hours. Samples are then harvested by centrifugation at 
3000xg suspended in RNALa/er^^ (Ambion) and immediately frozen in liquid nitrogen. 

B. RNA extraction 

25 RNA was extracted from the worm samples using the FastRNA® Kit, GREEN (Q-BIO) 

essentially according to the suppliers' instructions. 

C. Design and synthesis of the LNA capture probes 

Capture probes were designed using an in-house developed software. Regions with 
unique mRNA sequence of the selected target genes were identified. The optimal 50-mer 
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oligonucleotide sequences with respect to Tm, self-complementarity and secondary structure 
were selected. LNA modifications were incorporated to increase affinity and specificity. 

D. Printing of the LNA microarrays 

The microarrays were printed on knmobilizer™ MicroArray Slides (Exiqon, Denmark) 
5 using the MicroGrid II from Biorobotics (UK). The arrays were printed with a 10 ^iM capture 
probe solution. Two replicas of the capture probes were printed on each slide. 

E. Synthesis of fluorochrome labelled first strand cDNA from total RNA 

1 5 |ig of C elegans total RNA was combined with 5 |Lig oligo dT primer (T20VN) in an 
RNase free, pre-siliconized 1 .5 mL tube, and the final volume was adjusted with DEPC-water to 

10 14.5 jiL. The reaction mixture was heated at +70°C for 10 min., quenched on ice 5 min., spin 20 
seconds, followed by addition of 1 jiL SUPERase-In™ (20U/|LiL, Ambion, USA), 6 |lIL SxRTase 
buffer (Invitrogen, USA), 3 ^iL 0.1 M DTT (Invitrogen, USA), 1.5 \iL dNTP (20mM dATP, 
dGTP, dTTP; 4 mM dCTP in DEPC-water, Amersham Biosciences, USA), and 3 |iL Cy3™- 
dCTP or Cy5™-dCTP (Amersham Biosciences, USA). First strand cDNA synthesis was carried 

15 out by adding 1 \xL of Superscript™ II (Invitrogen, 200 U/mL), mixing and incubating the 
reaction mixture for 1 hour at 42°C. An additional 1 of Superscript™ II was added and the 
cDNA synthesis reaction mixture was incubated for an additional 1 hour at 42^C; the reaction 
was stopped by heating at 70°C for 5 min., and quenching on ice for 2 min. The RNA was 
hydrolyzed by adding 5 jiL of 1 M NaOH, and incubating at 70*^0 for 1 5 min. The samples were 

20 neutralized by adding 5 |xL of 1 M HCl, and purified by adding 450 ^iL IxTE buffer, pH 7.5 to 
the neutralized sample and transferring the samples onto a Microcon-30 concentrator. The 
samples were centrifiiged at 14000xg in a microcentrifuge for ^8 min, the flow-through was 
discarded and the washing step was repeated twice by refilling the filter with 450 ^1 IxTE buffer 
and by spinning for ~12 min. centrifugation was continued until the volume was reduced to 

25 about 5 |iL, and finally the labelled cDNA probe was eluted by inverting the Microcon-30 tube 
and spinning at lOOOxg for 3 min. 

F. Synthesis of fluorochrome labelled cRNA fi-om total RNA 

First and second strand cDNA syntheses were made using the MessageAmp^"^ aRNA Kit 
(Ambion) according to suppliers' instructions. Five microgram of C elegans total RNA was 
30 used as template for cDNA syntheses. Syntheses of fluorescent cRNA were made according to 
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the MessageAmp™ aRNA Kit (Ambion) protocol with minor modifications. Cy3^'^-UTP or 
CyS'^'^-UTP (6 ^1 of a 5 mM solution Amersham Biosciences, USA) replaced biotin-CTP. The 
final concentration of ATP, CTP, and GTP was 7.5 mM whereas the concentration of UTP was 
reduced to 4.9 mM. 
5 G. Hybridization with fluorochrome-labelled cDNA or cRNA 

The arrays were hybridized overnight using the following protocol. The Cy3™ and 
Cy5™-labelled cDNA or cRNA samples were combined in one tube followed by addition of 3 
|ilL 20xSSC (3xSSC final), 0.5 iiL 1 M HEPES, pH 7,0 (25 mM final), 25 |ig yeast tRNA (1.25 
\xg/[iL final), 0.6 |iiL 10% SDS (0.3% final), and DEPC-treated water to 20 ^iL final volume. The 

10 labelled cDNA target sample was filtered in a Millipore 0.22 micron spin column according to 
the manufacturer's instructions (Millipore, USA), and the probe was denatured by incubating the 
reaction at 100°C for 2 min. The sample was cooled at 20-25°C for 5 min. by spinning at max 
speed in a microcentrifiige. A LifterSlip (Erie Scientific Company, USA) was carefiilly placed 
on top of the microarray spotted on Immobilizer™ MicroArray Slide and the hybridization 

1 5 mixture was applied to the array from the side. An aliquot of 30 |LiL of 3xSSC was added to both 
ends of the hybridization chamber, and the Immobilizer™ MicroArray Slide was placed in the 
hybridization chamber. The chamber was sealed watertight and incubated at 65°C for 16-18 
hours submerged in a water bath. After hybridisation, the slide was removed carefiilly fi-om the 
hybridization chamber and washed using the following protocol. The Lifterslip coverslip was 

20 washed off in 2xSSC, pH 7.0 containing 0.1% SDS at room temperature for 1 min., followed by 
washing of the microarrays subsequently in 1 .OxSSC, pH 7.0 at room temperature for 1 min, and 
then in 0.2xSSC, pH 7.0 at room temperature for 1 min. Finally the slides were washed for 5 
seconds in O.OSxSSC, pH 7.0. The slides were then dried by centrifugation in a swinging bucket 
rotor at approximately 200 G for 2 min. The slide is now ready for scanning. 

25 H. Data analysis. 

Following washing and drying, the slides were scanned using a ScanArray 4000XL 
scanner (Perkin-Elmer Life Sciences, USA), and the array data were processed using the 
GenePix™ Pro 4.0 software package (Axon, USA). The data in each image was normalized so 
that the ratio of means of all of the features is equal to 1 . 

30 
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RESULTS 

Use of LNA-modified oligonucleotide capture probes in Exiqons C. elegans LNA tox 
oligoarray clearly allows the identification of distinct expression profiles for C. elegans genes 
relevant for general stress response and for the metabolism of toxic compounds. 

Table 1 7. Expression profiling using LNA Oligonucleotide Microarrays. 

Log2 transformed fold of changes for selected genes in the two expression profiling experiments 

hybridised with cRNA target. 


Gene name 

Tox compound 


Primaquine 

beta-naplithoflavone 

ABC_C34G6.4 


1.01 

ABC_F57C12.4 

-1.11 


CYP_C03G6.15 


2.35 

CYP_C06B3.3 


2.47 

CYP_C49G7.8 


2.40 

CYP_F14F7.2 

-1.24 

-1.03 

CYP_K07C6.5 


2.68 

CYP_K09D9.2 


2.14 

DC_W05G11.3 

1.16 , 


ER_26S 

-1.09 

-1.01 

HSP_C47E8.5 


1.17 

HSP_F26D10.3 


1.05 

HSP_F43D9.4 


1.27 

NAP_D2096.8 


1.14 

PPGB_F13D12.6 

1.08 

1.21 

RAD_Y116A8C.133 


1.13 

RPL_K11H12.2 • 

1.15 

1.42 

Ubi_F25B5.4 


1.37 
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Table 18. LNA-modified oligonucleotide capture probes. LNA modifications are depicted by 
uppercase letters in the sequence, mC denotes LNA methyl cytosine. (SEQ ID NOs: ) 

Sequence 

TgcmCatTgcAcgGgcActTgtTcgAtcTccTtcTgtTttActTttGgaTg 
TcaTtcTagGatTgcmCagAtgGttAtgAtamCtcAtgTcgGagAgaAagGa 
mCcaAtgTtgTttAatTggTtgTaaTgtmCttGatGacmCtgmCatAatmCatAt 
mCacAagAtcmCtgTgtTgtTctmCcgGaamCaaTgaAaaTgaActTagAtcmCa 
TacTtgTtcTcgAcaAagGttGtgTagmCcgAgtTtgAcamCtcmCgaAgaAa 
TgaActTggAtcmCctTctTtgmCatTtaGcgAtgAtcAaaTttGggAagmCg 
TcaTtaAttTtgTgtAgcTttmCttTctmCgaTttTtgmCacGatmCttTccmCc 
AggGtgmCctActAcaAacTgamCccAaaAgcAgaTgamCcgAgaAgaAatAa 
AttGaaAgcGacGcgGaaAgtGccAtgTatTtcTaaTttTgtTttmCttTa 
TtgTcaGcaTatmCaaGagTagAtaTggAagTggAtamCacTctGctAatmCc 
mCacmCttAttGcgTtcAatTttTgtTtcmCacmCtamCtamCtamCgaAtamCgtTg 
TcamCaaGggAgaGagTctGcgGtcGgtGctGgcGttmCgaGaaAatAtaAc 
mCatGcaTccmCgamCgaGaaGaaGtamCtcAttTtgGagTtaTctGgcGaaTt 
GacmCatGctmCcgGtcGtcAtgmCaaAtcGacTtcTaaAttGctTctGatTa 
TtgmCatGctGttAaaAccTatmCgtGtamCaaTatTgcmCtgTatAttmCccmCt 
TggmCacAgcTtaAtaAcaAatTggAaaGtcGagGatTagTcgGtgTtgAa 
GacAcamCgcAaaGgaTatGgaTgtTgtTgaGctGctGacTgaAgtmCaaTa 
AgcAcgAaamCtcTgcmCgtmCtaAaaTtcActmCgtGatTcaTtgmCccAatTg 
AtgGtcAtamCtcTaaAatGggmCagAacTtcAacmCaaAtcAttmCtcGtcAg 
AacmCcgAgcTtgmCcgmCaaAgtGcaAgaAaaTtaTagAacGaaTgaAacAg 
GgaTggGtcGagmCgtGagAccTacTacTaaAgaAcaGctTgtGaaTctTt 
mCaamCgtTctmCgaTtcmCtamCggAcaAgaAtgGacmCtaTgcnriCaamCagA 
CEATPase_B0365.3_u386_LNA3 aaGa 
CEATPase_C1 7H1 2.14_u356_LNA 

3 TgcTcgTtaTccAgcTatTttGaaGggActTgtmCatGcaAggActTctTc 
CEATPase_C17H12.14_u89_LNA3 mCcgTttAgaGctTatTgcTaamCcaGatTgtmCccAcaAgtmCagAacAgcTc 
CEATPase_F55F3.3_u215_LNA3 TgamCggAcgmCtamCtamCccAtaTgtAttTgtTccAtcTtamCcaGcaAccAa 

AgcTacTtcAttmCgamCaaGgaAcaTctmCggAaaAgtmCaaGtamCatmCccG 
CEATPase_F55F3.3_u275_LNA3 g 

CEATPase_Y49A3A.2_u103_LNA3 AaaTtcAagGatmCcaGttGccGatGgtGaaGccAagAttmCgcAagGatTa 

mCgaTcgTttmCtgmCccAttmCtamCaaGacTgtmCggTatGctmCaaGaaTatG 
CEATPase_Y49A3A.2_u272_LNA3 a 

CECALR_Y38A10A.5_u238_LNA3 TcaGgaAcgAtcTttGacAacAttAtcAtcAccGacTctGttGagGagGc 
CECALR_Y38A10A.5_u296_LNA3 TgaActmCtamCtcTtaTgaAagmCtgGggAgcmCatmCggAttmCgaTttGtgGc 
CECAT_Y54G11A.5b_u137_LNA3 GaamCttTgcAggGccGctmCggGgaAtgTcaTgaTttmCatTatTaaGggAa 
CECAT_Y54G1 1 A.5b_u1 89_LNA3 GtcAatTctGggAgaAggTgtTggAtamCcgGggmCtcGggAgaGaaTgtGc 


Oligo Name 

CEABC_,C34G6.4_u293_LNA3 

CEABC_C34G6.4_u375_LNA3 

CEABC_F57C12.4_u15_LNA3 

CEABC_F57C1 2.4_u480_LNA3 

CEABC_F57C12.5_u1 1 1_LNA3 

CEABC_F57C12.5_u444_LNA3 

CEABC_K08E7.9_d8_LNA3 

CEABC__K08E7.9__u51_LNA3 

CEABC_Y39D8C. 1 _u37_LNA3 

CEABC_Y39D8C.1_u422_LNA3 

CEADH_H24K24.3a_d3_LNA3 

CEADH_H24K24.3a_u50_LNA3 

CEAPEX_R09B3.1_u191_LNA3 

CEAPEX_R09B3.1_u37_LNA3 

CEAPO_C35D1 0.9_u1 5_LNA3 

CEAPO_C35D1 0.9_u609_LNA3 

CEAPO.C48D1 .2_u176_LNA3 

CEAPO_C48D1 .2_u23_LNA3 

CEAPO_F20C5.1_u453_LNA3 

CEAPO_F20C5.1_u96_LNA3 

CEATPase B0365.3 u31 LNA3 
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Oligo Name 

CECC„C03D6.3_u275_LNA3 

CECC„C03D6.3_u430_LNA3 

CECC_C07G2.3_d9_LNA3 

CECC_C07G2.3_u44_LNA3 

CECC_Y46G5A.2_^u331_LNA3 

CECC_Y46G5A.2_u385_LNA3 

CECoA_C29F3.1_u316_LNA3 

CEGoA_C29F3.1_u392_LNA3 

CECoA_F08A8.4_u 1 094_LN A3 

CECoA_F08A8.4_u1260_LNA3 

CECoA_F59F4.1_u109_LNA3 

CECoA_F59F4. 1 _u424_LNA3 

CECoA_Y25C1 A.13_u1 15_LNA3 

CECoA_Y25C1A.13_u451_LNA3 

CECOL_C27H5.5_u493_LNA3 

CECOL_C27H5.5_u680_LNA3 

CECOQ_ZC395.2_u1 99_LNA3 

CECOQ_ZC395.2_u400_LNA3 

CECRYZ_F39B2.3_u171_LNA3 

CECRYZ_F39B2.3_u222_LNA3 

CECyclin_R02F2. 1 a_u24_LNA3 
CECyclin_R02F2.1a_u312_LNA3 
CECyclin_ZC168.4_u203_LNA3 
CECyclin_ZC1 68.4_u273_LNA3 
CECYP_B021 3. 1 5_u1 33_LNA3 
CECYP_B021 3.1 5__u202_LNA3 
CECYP_B0304.3_u38.LNA3 
CECYP_B0304.3_u89_LNA3 
CECYP_C03G6. 1 4_u706_LN A3 
CECYP_C03G6. 1 4_u768_LNA3 
CECYP_C03G6.1 5_d9_LNA3 
CECYP_C03G6.1 5_u148_LNA3 
CECYP_C06B3.3_u102_LNA3 
CECYP_C06B3.3_u474_LNA3 
CECYP_C1 2D5.7_u399_LNA3 
CECYP C12D5-7 u65 LNA3 


Sequence 

AtgTaaAgaAggAatGctTccmCgaAtgGatTggAtaTttAttTgtmCcaGa 

GgamCcgAaaTttGtgmCagmCatGtcGgamCacGaaAttGatGgtmCtcAttTt 

mCagAcamCgaAggTtamCgaTagAtaAccAtcTctmCaaAgtmCtaTcgAccTc 

mCgamCgaTgtGcgTgtTccTgamCgaTgaAagAatGggAtaTtaAgaAaamCc 

TtgTgcTccAtcGctGctmCcgmCttAcaGacTtgAcaAcgmCtcAccTttGc 

AatGagmCggTtgTgcmCgtGtgAcgTcamCttmCgtmCacAgtGttGctmCtamCt 

AaaTtgAcamCcaAtcAaaTctGtcTcaTctmCctGagGacmCgtmCaamCttmCg 

AatmCttTgtGtamCggAgaTggGgcAaaAggmCagmCaaGaaAgtAaamCcaAg 

AggAcaAggGgcActActGgcAcaGgcTttGatTatTgcAgtGagAtaTt 

TtaAtgGagGtgAcaAtgGgtTccTtgGatTcgAtaAatTocGagTgcmCc 

GctmCttmCtcmCaglggGctmCaaAatAgtrnCaamCtcAacAgaTcgGaaGttmC 

t 

AaaGctTcgAgaTggmCacGttmCgtmCtgTatmCtcGtgAagAacTtaTtgmCa 

GatTcgmCtgAacTttAtcAagAcgTggAatAtgAgcmCagmCtcmCtgTcgAc 

GatmCttAtcAccGcgTgcGatAttmCgaGtaGctTcamCagGatGcgAttTt 

GgaAagGaaGgaTccAttmCtcAgcTctGcamCttmCcamCcaTcaGagmCcaTg 

TggAtamCaaGgaGggAtcTggmCagTggTggAtcTggAagTggTggAtaTg 

TtgAaaGaamCtcmCttGccGacGatmCctGaaAcamCacAaaGaaTtgmCtgAa 

AtgTggGatGagGagAaaGaamCatTtaGatAcaAtgGaaAgaTtaGctGc 

AggmCtgAgcTctTggActTtgGcaTcaAcaTtgTctmCatTctTgaAggAa 

TtaTggTtamCagAagGagmCtgTttAcgGtgTagmCatTggGaaTgtmCttmCc 

mCacTtcAacmCaamCtcmCgtGttAatmCaaGcaAgcmCgcmCacmCatmCta 

AtgAg 

TctmCatTgcTcgTcgAggmCtamCcaAcaAacActGgcAatAccmCaaTtaAt 

TaaGaaAgtmCatTgaGgaTgcTgtmCgcTttGctmCgcmCgaAgtmCtcGtaTa 

AagTtcAtcmCtgTtgAcgGaaTcgAggmCggAgaAtgmCtgTatmCggTcaTt 

AcaGgaAatAtgAttTtgGatTtcGatTttGaaTcgGttGgtGctGccmCc 

GctGagmCtgTatTtgGctAgtGaaAtgTgtGttTttGatActTtaAatGa 

AcgAggTttGgaTcamCaaTcaGaaTtcTgtGaaAtaAgcGttTttTggGa 

AgtTctmCggTctAacAgtGtcTccmCgtTgaAtaTtcTtgTaaAatmCacAc 

AtgAccActmCaaAatActGctAaaAgaTttGcaGcgGcaGaaGccGttAa 

TtgAtaTggmCtgTacmCtgTatGgtTttTgaGgamCgtTttTtaGgaGtcGa 

AttTatTcaTtcAtcmCatGtaAacTgtAtaTttTgaAttTgtGttGtaAa 

GccAaaGcaGaaTtgTatTtgAtcTtcGgtAacmCttmCtcmCttmCgcTacAa 

AttTtgAatmCttmCtgGgaAaaTgcmCatmCcamCtcGagAaamCcgTtcmCgtTt 

mCtaAcgGagGatmCtcGccAatTatmCttTgaGagAcaAaamCtgAaamCtcmCt 

AtcTagTccmCaaTgaAtcTccmCacAtgmCtgTtamCtcGtgAtgTtcAacTc 

TttTgcTttmCatmCgcAaaAgcTcaAgaTtamCacAtgTcaGgtmCaaGccAa 
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Oligo Name 

CECYP_C45H4.1 7_u27_LNA3 

CECYP_C45H4.1 7_u598_LNA3 

CECYP_C45H4.2_u110.LNA3 

CECYP_C45H4.2_u429_LNA3 

CECYP_C49C8.4_u363_LNA3 

CECYP_C49C8.4_u883_LNA3 

CECYP_C49G7.8_d6_LNA3 

CECYP_C49G7.8_u795_LNA3 

CECYP_F01 D5.9_u374_LNA3 

CECYP_F01 D5.9_u46_LNA3 

CECYP_F08F3.7_u25_LNA3 

CECYP_F08F3.7_u401_LNA3 

CECYP_F14F7.2_u397_LNA3 

CECYP_F14F7.2_u68_LNA3 

CECYP_F42A9.5_u435_LNA3 

CECYP_F42A9.5_u55_LNA3 

CECYP_K07C6.3_u3_LNA3 

CECYP_K07C6.3_u354_LNA3 

CECYP_K07C6.4_u1 18_LNA3 

CECYP_K07C6.4_u87_LNA3 
CECYP_K07C6.5_u7_LNA3 
CECYP_K07C6.5_u99_LNA3 
CECYP_K09A1 1 .3_u362_LNA3 

CECYP_K09A1 1 .3_u48_LNA3 
CECYP_K09A1 1 .4_u238_LNA3 
CECYP_K09A1 1 .4_u68_LNA3 
CECYP_K09D9.2_u1 51_LNA3 
CECYP_K09D9.2„u866_LNA3 
CECYP_T1 0B9. 1 0_u4 1 0_LNA 
CECYP_T1 0B9, 1 0_u56_LNA 
CECYP_T1 0B9.7_u1 02_LNA3 
CECYP_T1 0B9.7_u267_LNA3 
CECYP_T19B10.1_u100_LNA3 
CECYP_T19B10.1_u319_LNA3 
CECYP_Y49C4A.9_u 1 2 1_LNA3 
CECYP_Y49C4A.9_u4 1 3_LNA3 


Sequence 

mCcgmCgamCttTaaAgaGaaGatmCatAaaTttGcaTtgTttTttGttTgtAt 

mCgaGggTgaTtcGgaGacTttmCagTaaTgtmCcaActTtcAaaTgtTtgmCa 

TagAtamCaaGatAcaTccmCtcAaaAgaAggmCctAccGtcAatGgcmCaaAg 

TcaAcgmCgtmCtaTaaAtgAatmCacAacGagGtaTcaAcaTtcTccmCccTg 

AtgmCtgAtgTtgAaaTtgmCtgGctAccGtaTtcmCaaAagAtamCtgTaaTc 

AtgAatmCcaTggmCttGgamCatmCtcmCcgTttTtcAagGgaTatAaaAatGt 

AtgmCaamCgaAttAgtGaaAaaTtcAtcmCtgGaaTaaAaaAtaAttmCtaAa 

AtcGctAcgAcaAtcTttmCcgAtgmCctTcgAagTttmCgaAagmCttTctmCt 

GagGtcGgtGgaGgaGgaAgtGgaAatTgamCggmCaaAatmCctGccmCaaGg 

mCccTctTtgGgaTttmCcamCtcAagTttActGttmCggmCagmCagTgaTatAa 

GagTtgGttmCcamCagAatGctTagGacGttTaaAttmCgtmCacAaamCttTt 

mCaaTatGgtTccmCatTttAgcAacTcaTatGaamCacAgaAgaTgtmCctTg 

GaaAaaGgcGtcGacAttTtaTgtGacAcgTggAcamCttmCacTatGacAa 

TaaTtgAatTacGggTctTttGtamCatAttAatTttAgtAtamCttTgtGa 

AtaTcaAtgmCaamCtaTtaAtgAatmCacAacGtcTtgmCcaAtcTtcTccmCg 

GgaGtgActAtgAaaGcaAagAgtTacmCgaTtgAaamCtgAaaGacAgamCa 

AatmCttTaaTgaTaaTttAtgGgaTctGtaTttmCtcTttmCtgTcaAtaAa 

AtgAgcmCcamCaaAtgTaaAagGatAcgAgaTtgAttmCggGaamCagTcaTg 

AtcmCtgmCgaTatGacAttAagmCcamCatGgtTctGaamCctTcaAcaGaaGa 

mCtgAacmCtlmCaamCagAagAtaAacTtcmCgtAtaGcgmCtgGaaAaamCtc 

met 

AttTaaAggAatTcamCagmCtcAaaAaaTaaTaamCtamCcgGttmCagAgaTt 
AatTtgAgcmCacAtgGcaAgtTatmCaamCagAggAgamCaaTgcmCgtAcaGt 
TgamCatTctActTaaAggGaaGaaAatAccAacTggTacmCctTgtAttTg 
TcamCcamCaaAgcmCatAcaTatGcgAgcTagTtcmCtcAggmCtgmCttAaam 

Cc 

TtcGacAaaActAttTtgGaaAgaAcaAtcmCcaTtcAgtGtcGgcAaamCg 

TctGacAacAaaGccAtamCacGtgmCcgActAatTccAcaAtcAgcTagAa 

TtgGcaAaaGcaGaaTtgTatTtaAtcTttGgaAacmCtcmCttmCttmCgcTa 

TgaAtcTttmCaaActTatmCacTccTttTaaTacTacmCgtTccTgtTtgGa 

AttGagAttGtaTccAttGgcGtcTctTgtTcamCaaTcgAaaAtgTctmCa 

AacTgcTacTatTgcGccAtcAagTgtGctGctmCaaActTaaAtcmCagGt 

TtgAgamCagGaaAtaAgamCtaGaaTtcmCttTgaAacTggTggGaaGtgmCt 

AagAtgTcaAagAatTcaAgcmCagAacGatGgtmCcamCcgAcgAgcmCatTa 

AttGaamCcaActmCtgAaaTatAatGacAcaAaamCcaTgtmCtgGaaGtgGt 

GgcAatGtgAcaAtaTctmCcaAtgGttmCttmCacAgcAatmCatmCacGtgTt 

mCtaTtcAatmCgaTatTttAtcAcamCcaTccAgtGctGgamCctmCcaTcaTt 

GtcTcaGagAtgTgtAaaTttActTccmCtgmCaaTttGttTcamCgcAacTa 
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Oligo Name 

CECYP_ZK1 77.5_u394_LNA3 
CECYP_ZK1 77.5_u445_LNA3 
CEDAO_C47A1 0.5_d9_LNA3 
CEDAO„C47A1 0.5_u269_LNA3 
CEDC_C01A2.3_u373_LNA3 
CEDC_C01A2.3_u96_LNA4 
CEDC_C34F6.1_u301_LNA3 
CEDC_C34F6.1_u450_LNA3 
CEDC_F33D11.3_u126_LNA3 
CEDC_F33D1 1 .3_u14_LNA3 

CEDC_F46E1 0.2.u392_LNA3 
CEDC_F46E1 0.2_u54_LNA3 
CEDC_F56G4.2_u382_LNA3 

CEDC_F56G4.2_u82_LNA3 
CEDC_M162.2_u103_LNA3 
CEDC_M162.2_u480_LNA3 
CEDC_R10E4.1 1_u274_LNA3 
CEDC_R10E4.1 1_u397_LNA3 
CEDC_T04C9.1_u321_LNA3 
CEDC.T04C9. 1_u64_LNA3 

CEDC_W02A2.3_u32_LNA3 

CEDC_W02A2.3_u374_LNA3 

CEDC_W05G1 1 .3_u153_LNA3 

CEDC_W05G1 1 .3_u51_LNA3 

CEDC_ZK863.5_u256_LNA3 

CEDC„ZK863.5,u324_LNA3 

CEEPHX_Y55B1BR.4_u161_LNA3 

CEEPHX_Y55B1 BR.4_u93_LNA3 

CEER_18S_u388_LNA3 

CEER_18S_u82_LNA3 

CEER_26S_u342_LNA3 

CEER_26S_u38_LNA3 

CEFOXO_R13H8.1b_u331_LNA3 

CEFOXO R13H8.1b u393 LNA3 


Sequence 

TtcmCgaAtgTttmCcaAttGggActGaaGttTcaAgaGtcAccmCagAaaAa 

GatmCcaGcaTctTccAagmCttAcaTtcmCtcmCgtGctTgtAtcAagGaaAc 

TttGaaAacmCtgTttTatTatTaaAatAgaTaaTtgAttAgtTctGtamCg 

AtamCgtTgcActGcaTccGgcTatGagGgaGccAaaAatmCttAggGgaGt 

GcamCttmCcaTtcAtcTctGcaGctActAtgGctTtgGtgAcaAaaGttGg 

mCcgTccAaaAgaAtgmCcaTctmCacAagTctTgaAatmCttAtaAagGtaGt 

GagGgaTcaAcaGtaAccTcgTgcGgtAttGacAagGgaTgtmCcgGaaGg 

GatGgtTctTcgAtcGcaAacAaaAcaGatGtgmCtcmCatTtamCatAcgGa 

AtgGagAaaAtgGatmCtgAtgGagTtgmCagGaaGtgAtgGagmCtcmCagGa 

TgaAtcTccAtaAatTatTcaAtgTttmCcaAatAttTaaTttAtcAatTg 

GctmCaamCacGgtAggAtcmCtaTggAacmCgtmCggAggAgcAggmCclmCg 


mCgtGacAacmCtcTtaTttAttTctGtaAaamCtgAttmCgcmCaaActTttGt 
GaaGctTtcAaamCcaAatGagTtcmCttmCccGgaAtcmCcaAagAatAccAa 

AcaAtgAaaAgaGagGatGgaAagGaaAtcGaaGtcTctGttmCttGacGa 

GatGagGtamCatAacTttGtgTgcAgtTatAggmCcaTctAcaGtamCctGc 

TtcmCatmCatmCacTaamCcgAttGtcmCtgAcaTtgAtgGccAaamCcaGggAa 

TcamCatTatmCgaAcaAgtActAgtAagmCatGctGtgAtgGagTgcmCgcTa 

mCacGgaGatmCacGacAtcAaaGcgGatTgcTtaGagTgtGgaAacmCgtmCt 

ActAtcTacGtgGcamCgtTggActmCatmCatmCgaTggGaamCgamCgtAtaAg 

TctmCtgGccAgtTcamCttTgtGatmCaaTctmCagAttmCgtmCcamCacAagAt 

mCtamCttmCcgmCaaGaaGgcmCcgTcgTttmCtaAtcGatmCgaAcaTctmCac 

Ac 

AtgGatGatmCgamCccActTgcmCacTgamCccAcaAtcmCcgmCacTcamCta 
mCc 

AagAcgGagAggmCtgGagAgaAcgGtamCcgAtgGagAgcmCagGaamCtgAt 

mCcamCccAggAggAggGatAcaAgaGaaGaaAgtAcaGatTctmCcaActAa 

AgtTtcAcamCttmCttTttGccGttTtgGttmCccGttAtcAatmCcaTtgAt 

mCttTtaTatTctmCatmCaaTttGttTccTacTtgGtcAgcTgaGgaTcgTt 

TtcGgcAcaAatGgaGcaAaaGtaTcgTggTtaTtgTgaTgcGatTatTc 

mCtamCtaTgaAtgAgcTcamCtgGacTcaTttAtcAacTcgAgtmCaaAagmCc 

GttGgcGaaTctTcgGgtTcgTatAacTtcTtaGagGgaTaaGcgGtgTt 

GaamCtgAttmCgaGaaGagTggGgamCtgTcgmCttmCgaGgtTtaAcgActTc 

TgtTatTgcGaaAgtAatmCctGctTagTacGagAggAacAgcGggTtcAa 

TgcAtamCgamCttGgtmCtcTtgGtcAagGtgTtgTatTcaGtaGagmCagTc 

TgtGctmCagAatmCcamCttmCttmCgaAatmCcaAttGtgmCcaAgcActAacTt 

TtaAgamCggAacmCaaTtgmCtcmCacmCacmCatmCatAccAcgAgtTgaAca 

Gt 


243 


Oligo Name 

CEGAPDH_K10B3.7_u21_LNA3 
CEGAPDH_K10B3.7_u727_LNA3 
CEGBA^F1 1 E6.1a_u232_LNA3 
CEGBA_F11E6.1a_u451_LNA3 
CEGLU^C02A12.1_u264_LNA3 
CEGLU_C02A12.1_u55_LNA3 
CEGLU_C46F1 1 .2_u271_LNA3 
CEGLU_C46F1 1 .2_u45_LNA3 
CEGLU_F26E4.12_u109_LNA3 
CEGLU_F26E4 . 1 2_u480_LNA3 
CEGLU_R07B1 .4_u166_LNA3 
CEGLU_R07B1 .4_u38_LNA3 
CEGLU_T09A1 2.2_u220_LNA3 
CEGLU_T09A1 2.2_u335_LNA3 
CEGLU_T28A1 1.1 1_u299_LNA3 
CEGLU_T28A1 1.1 1_u54__LNA3 
CEGPD_B0035.5_u256_LNA3 
CEGPD_B0035.5_u478_LNA3 

CEHSP_C09B8.6,d8_LNA3 

CEHSP_C09B8.6_u286_LNA3 

CEHSP_C12C8.1_u127_LNA3 

CEHSP_C1 2C8. 1_u1 531_LNA3 

CEHSP_C47E8.5_u310_LNA3 

CEHSP_C47E8.5_u361_LNA3 

CEHSP_F26D1 0.3_u276_LNA3 

CEHSP_F26D10.3_u397_LNA3 

CEHSP_F43D9.4_u169_LNA3 

CEHSP_F43D9.4_u275_LNA3 

CEHSP_F44E5.4/5_u123_LNA3 
CEHSP_F44E5.4/5_u380_LNA3 
CEHSP_F52E1.7_u175_LNA3 
CEHSP_F52E1 .7_u448_LNA3 
CEHSP_F54D5.8_u252_LNA3 
CEHSP F54D5.8 u318 LNA3 


Sequence 

AcaTtgmCtamCcaAggmCctAagmCcgmCttmCaaAttmCtcTaaGtcTgaAatG 
a 

GttGagTccAccGgaGtcTtcAccAccAtcGagAagGccAatGctmCacTt 

AgtAaaTtcmCttmCcamCgtGgaTctActmCgtGtgTtcAcaAagAtcGagGg 

GgtmCcaAtaAtgGgaGacTggTtcmCgcGcaGaaAgtTatGcaGatGatAt 

AgaAaamCttmCgtTggAccmCtgmCtaAggAgaAgtAttTcaAgcTtcTgaGc 

GagmCacmCcgAagmCtcAagmCcaTatTtgGaaAcaAgamCcaTacTctTcaAa 

GttAccmCtcTacAaaTctmCgcTtcAatmCcaAtgTtgTtcGcaGtcAccAa 

mCcgAagAgcTcgTtamCtaTgcGagGagGtgTgaAgcmCggAatAatTttTt 

AagTtcTtgGttGgamCgcGatGggAaaAttAtcAagAgaTttGgamCcaAc 

AcgAttTcaAcgTcaAaaAtgmCtaAtgGtgAtgAcgTgtmCacTttmCggAt 

AccTggGttGatGttTttGcgGctGaaAgtTtcTccAagmCtcAttGatTa 

GaaGtamCgtmCtcmCcaAagAaaAgcTacmCccAgcTtaAggmCatTgcAcaAt 

GcgmCcaGatAtgTatTcaAagAtcGagGtaAatGgtmCagAacActmCatmCc 

AatmCtamCagGgaAaaAggAttTcgAgtTgcmCgcGttTccAtgmCaaTcaAt 

AgaTggmCaaAgaAgcAtamCatAacTgaAacTclTccmCggGgaGctActAc 

TgaAtaAacGggmCcgAacTaaAtcmCatTcgTcaGtgGaaAtgGgaAacAa 

GtcmCgtmCttmCctGatGctTatGaamCgcmCtaTttmCtcGaaGtaTtcAtgGg 

TgtGgaAaaGctmCtcAacGagAagAaaGcaGaaGttmCgtAtamCaaTtcAa 

AtaTcgmCcgmCctGctTccTcamCcaAccmCgaAtaAcgmCaamCaaAaamCtt 

Ta 

AagAgcmCcamCtcAtcAagGatGaaAgtGatGgaAagActmCttmCgtmCtcAg 

mCaaGatAttTtaAcaAaaAtgmCatmCaamCaaGaaGccmCaaTcaGgtTccGg 

mCttGggmCatTctGtamCggGatGctGtcAttActGtgmCctGcaTatTttAa 

AagAagmCatmCtcGaaAtcAacmCcaGacmCacGctAtcAtgAagAcamCttmCg 

AtgAaaGctmCaaGctmCttmCgtGatTccTctAclAtgGgaTacAtgGccGc 

TtaAgcAgamCcaTtgAggAcgAgaAgcTcaAggAtaAgaTcaGccmCagAa 

mCgtmCttTccAagGatGacAttGaamCgcAtgGtcAacGaaGctGagAaaTa 

GtcGacTlgGctmCacAtcmCacAccGtcAtcAacAagGaaGgamCagAtgAc 

mCaaTctTgaGggAcamCgtTctmCacmCatTgaGggAcamCcamCgaGgtmCa 

aGa 

TcamCtaAaaTgcAccAatmCtgGacAatmCttmCtgmCttmCtgmCtgGatGcgmC 
t 

TcaTgaAgcTaaAcaAttmCgaAaaGgaAgaTggTgaAcaAcgGgaAcgTg 

AagTatAacmCttmCcaAcaGggGtcmCgtmCcaGaamCaaAtcAagTccGaaTt 

TttAacmCatGgcmCgcAgaTtcTtcGatGacGtcGacTttGatmCgcmCacAt 

GcgTcgAaaAgaTctmCccTgaAgtmCtgmCatTgamCtgGccTtgAtaTtaTg 

AcaTagTctTcgTcaTcaAggAtaAgcmCacAccmCgaAatTcaAgcGagAg 
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Oligo Name 

CEHUS_H26D21 .1_u1 17_LNA3 
CEHUS_H26D21 .1_u478_LNA3 
CEMRE_ZC302.1_u169_LNA3 
CEMRE_ZC302.1_u292__LNA3 
CEMTL_T08G5. 1 0_d 1 27_LNA3 
CEMTL_T08G5. 1 0_u45_LNA3 
CENAP_D2096.8_u356_LNA3 
CENAP_D2096.8_u70_LNA3 
CEPAI_F56D12.5_u241_LNA3 
CEPALF56D12.5_u301_LNA3 
CEPDLC07A1 2.4_u28_LNA3 
CEPDLC07A1 2.4„u433_LNA3 

CEPDLC14B1.1_u119_LNA3 
CEPDI_C14B1.1_u358_LNA3 
CEPGK_T03F1 .3_d9_LNA3 
CEPGK_T03F1 .3_u424_LNA3 
CEPON_E01A2.7_u223_LNA3 
CEPON_E01A2.7_u79_LNA3 

CEPPGB_F1 3D1 2.6_u44_LNA3 
CEPPGB_F13D12.6_u440_LNA3 

CEPPS_T14G10.1_d2_LNA3 

CEPPS_T14G10.1_u240_LNA3 

CEPRDX_R07E5.2_u405_LNA3 

CEPRDX_R07E5.2_u42_LNA3 

CEPYC_D2023.2_u256_LNA3 

CEPYC_D2023.2_u427_LNA3 

CERAD_F1 0G7.4_u1 69_LNA3 

CERAD_F10G7.4_u267_LNA3 
CERAD_F32A1 1 .2_u250_LNA3 
CERAD_F32A1 1 .2_u380_LNA3 
CERAD_T04H1 .4_u274_LNA3 
CERAD_T04H1 .4_u375_LNA3 
CERAD_W06D4.6_u325_LNA3 
CERAD_W06D4.6_u34_LNA3 


Sequence 

TcgmCcaAcamCtcGgamCacGtgmCcaAaaTgaAtaTcaTctmCaaAtcGaaTg 

GtcGaaGttAgaAatmCcaGaaGccGatAttGttlctmCatmCaaAttmCcaAt 

ActActmCgtGgaAgaTccAatAaaGttGttTcaAcgmCgamCaaAtcGatTc 

GgcAgtGaaGatGaaGtgGcaAatTctGatGaaGaaAtgGgaAgcAgtAt 

TtgTcaAcgAccAgaAgcAaaAatTatGggAatmCgcGatAaaAttmCaaGg 

GatGcaAgtGtgmCcaActGcgAatGtgmCtcAggmCtgmCtcAttAatTtgAa 

GacGatAtgTtcGatTtcmCcaGgaGagGacGgtGatGatGtgTcaGacTt 

GacGatAtgltcGatTtcmCcaGgaGagGacGgtGatGatGtgTcaGacTt 

GagGtcGtcGtaAtcmCacAagGctmCcaAgaAagmCaaGtgmCtcGacAttTc 

GatActTttGgcAagmCtcGttmCcaAtcAagAagGagGtcAtcmCcaGatmCg 

GatGagGagGgamCacAccGagmCtcTaaAtcmCacAttmCcaAtamCagTtcAa 

mCttAtgTccGaaGatAtcmCcaGagGatTggGacAagAacmCcaGtcAagAt 

TacmCccAgtmCgamCtalgaTggAgamCagAaamCctmCgaGaaGttmCgaAg 

aAt 

inCtcGtcGccTccAacTtcAacGaaAttGccmCttGatGaaAccAagActGt 

TtcTatTgtTtaTtcmCttGccmCaaTagTgtAttTgtAttTatTctTtcTc 

mCaaAtcmCatmCtcmCcaGtgGatTtcGtcAttGctGacAagTtcGccGagGa 

GttTctGatTcgAcamCttTatGgamCcaTctmCaaGttmCtgmCgaGttTctTt 

GggAaamCaaAtgAttGttGgtAcaGtaGccmCgcmCctGctAttmCacTgtGa 

mCgaGcamCatmCatmCcaAtcGttmCctGttmCaamCaaGgcmCttmCtaAtcGtt 

Ag 

TgaTgaGagmCccAgtAacmCaaTtaTttGaamCcgTcaGgaTgtGcgTaaGg 
iTiCgtmCtaAtcGaaGaaGggGatmCgtGggmCaaTcaTaamCtaAttAacmCttm 

Ca 

mCaaTggmCtcmCagGtcTttmCtgmCtcTtcAtaTacTtcmCatTccGagTtgmCt 

GttmCtcTtgGagmCtgAagTtgTcgmCgtGctmCgtGtgAttmCtcActTctmCt 

TcgmCtamCcaGcaAggAatActTcaAcaAggTcaAcaAgtGatmCacAcaGa 

AagGaaAttGtaActmCgcmCcaAgaGctmCtcmCcaGgtGtcmCgtGgamCatAt 

TtgActGgaltgGagAttGcgGaaGaaGttGatGttGaaAtcGagAgtGg 

GccAagTctmCaaGcaAtaAgtGttGatmCaaTcaGagmCcaTacGgaGagAt 

AtaTtgAgamCttmCggGacAagmCggActTctmCatmCtgTcamCagmCaamCtg 

mCc 

GatmCcgmCagAgaAtcGagTatTtcmCtcTcgAgamCccAtgGatAtcAacTg 

TccGttAagAagmCtcActGgaAaaAcamCacGgcTcgAacGaaAttGgaAt 

AatTtgGatGagAgcAaaGtgGaaGgaAtgGctAtcGttTtgGcaGatAt 

GtgmCtgGtcAaaAaaTgcTtgmCttmCgtTgcTtaTtcGcaTtgmCacTcgmCa 

mCttmCgaGaamCtcTtcAagTtgGaaTcaAcaGtgGcaTcgGatAcamCatGa 

GtgmCctlctGaaGccGaaGaaAacGacGatlagTtaAatGttTccAagTt 
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Oligo Name 

CERAD_Y1 1 6A8C. 1 3_u289_LNA3 
CERAD_Y1 1 6A8C. 1 3_u59_LN A3 
CERAD_Y39A1A.23_u221_LNA3 
CERAD_Y39A1 A.23„u276_LNA3 
CERAD_Y41 C4A. 1 4_u509_LNA3 

CERAD_Y41C4A.14_u731_LNA3 
CERAD_Y43C5A.6_u131_LNA3 
CERAD_Y43C5A.6_u429_LNA3 
CERFC_F31 E3.3_u128_LNA3 
CERFC_F31 E3.3_u55_LNA3 
CERPL^K1 1 H1 2.2_d1_LNA3 

CERPL_K1 1 H12.2_u172_LNA3 

CERT_F36A4.7_u1396_LNA3 

CERT_F36A4.7_u2302_LNA3 

CERT_F36A4.7_u289_LNA3 

CERT__F36A4.7_u291 9_LNA3 

CERT_F36A4.7_u4269_LNA3 

CERT_F36A4.7_u5485_LNA3 

CESLC_F52F12.1a_u249_LNA3 

CESLC_F52F12.1a_u76_LNA3 
CESLC^K1 1 G9.5_u400_LNA3 
CESLC_K1 1 G9.5_u462_LNA3 
CESLC_Y32F6B.1_u179_LNA3 
CESLC_Y32F6B.1_u280_LNA3 
CESLC_Y37A1C.1a_u104_LNA3 
CESLC_Y37A1 C. 1 a_u404_LN A3 
CESLC_Y70G1 0A.3„u383_LNA3 
CESLC_Y70G1 0A.3_u46_LNA3 
CES0D_C1 5F1 .7_u435_LNA3 
CES0D_C1 5F1 .7_u9_LNA3 
CESOD_F10D1 1 .1_u326_LNA3 
CES0D_F1 0D1 1 .1_u477_LNA3 

CESULT_EEED8.2_u31 6_LNA3 
CESULT EEED8.2 u82 LNA3 


Sequence 

GatAaaAtcGatAgcGacGacGatGagGaaGccGatGatGagGagmCtcGa 

GcaGgtGgaTacGgaTgtGgaGctGacTttTgcGttTtaTcaAgaAtcTc 

TccmCgtAgaAgtAgaAatGctAgaAgaAccTgaAcaAgaAgaTcaAgaAa 

TgcAagAtgTcaGtaTtgAaamCaaTtcmCtgTagAgamCccmCcgAagAaaAt 

AgtmCtcGtaTccGggAatGttTcaGccTgtGaaAatGctTgtTgaAgamCg 

mCttmCaaAacmCgtmCgcTttTaaGgaTacAggAacGtgGcamCgcTtcmCgaG 

g 

mCagAttGtamCctTcgAaaAggAaaAggAgaGaaTcgmCgtmCgcAaaAatGg 

TgaTggmCttTgaTtaTtcGagmCagGagmCaaTgaTgtmCcgAgaGtcGttAt 

mCaaTgamCgaGaaTatTggAgtAatGggGaaActGgtTgcGacTlgmCgaAa 

TtgGaaAacAatmCtcmCtcGacTttmCtgmCtcAclmCttmCgtGaaActAtcmCa 

TctTgtTatTttAttTtgTttTggGctTgtTccGaaAatGaaAtgGttGt 

mCaaTggAtcAccAagmCcaGttmCacAagmCacmCgtGagmCaaAgaGgamCt 

cAc 

mCttTgtGatGtgAtgActGcgAagGgamCacTtgAtgGctAttAcgAgamCa 

GagmCcaGctActmCagAtgAcamCtcAacAcgTtcmCalTatGcaGgaGttTc 

TacActmCcaTccTcgmCcgAcaTacAatmCcaAcaTctmCGamCgcGgaTtcTc 

AtgGagAagAtgGttTggAtgGaaTgtGggTtgAgaAtcAgaAtaTgcmCg 

AacmCggGatAccGtgTcgAacGtcAcaTgaAagAtgGcgAtaTaaTcgTc 

GagGagAttAaamCgcAtgTcaGtgGctmCatGtcGagTttmCcaGaaGtcTa 

AgaTatTgcmCtcTacTtaTcaTggGccTgaTggmCttTgtmCtgmCcgGtaTt 

GaaTctmCaamCcamCttmCtgGaamCccmCatAcamCcaAtgGatAgaAgamC 

ggAg 

GttGttmCttTttTccGtgAtcTttTcaTgtTtaTgtmCtgAacGtgGcaGg 

GacTcgTtgGtgTctTgcTagGatGtcTtgGgtTcaTtcmCtcAatmCgtTg 

GtamCtgGgcTcgAggGctGaaActAatmCgaAgaAgaAacTccAgaAgaTa 

GgaTcaTgcTctGttTacGacActGatGagTtaAgaGtcAgamCtgmCacGt 

mCgaTggTtcTtcTcgTctAtcAtaTcgGggTagTtgmCcgAagTgtTgaAa 

mCaaAtcGaamCtgGtaTaaAggAggAccGacGgaGacGaaTttGaamCgaGa 

AttmCgaTcaAagAacTctGgcTctmCggmCgtTaamCtgGacAttTgtTcgTc 

mCtcmCccGagmCagGcgAttAttmCacGctAgtTatGctmCaaAtgTgaTctGt 

mCcgGtamCtaTctGgaTcamCacAgaAgtmCcgAaaAtgAccAggmCagTtaTt 

mCccAgtGacTacmCtgAatmCgcGtcTctGaaTctmCcamCacAatTccTacTa 

GgaGttGctmCacmCgcAatTaaGagmCgamCttmCggAtcTctGgaTaaTctTc 

AaaTtgAggAaaAgcTtcAcgAggmCggTctmCcaAagGaaAcgTcaAagAa 

mCaaTcgTacmCatGaaAgaAgtTggAagmCcamCgtGcaAgaGaaGaaAtcmC 

a 

AagAagAttmCctGacmCagAgaGacTcamCgtGctTacmCcaAgaAgcAtcTa 


246 


Oligo Name 

CESULT_Y1 13G7A.1 1_u252_LNA 
3 

CESULT_Y1 13G7A.1 1_u96_LNA3 

CESULT_Y67A1 0A.4_u1 08_LNA3 

CESULT_Y67A1 0A.4_u327_LNA3 

CETOPO^KI 2D1 2. 1_,u398_.LNA3 

CETOPO_K12D12.1_u449_LNA3 

CETOPO_M01 E5.5b_u256_LNA3 

CETOPO_M01 E5.5b_u429_LNA3 

CEUbLF25B5.4_u1 86_LNA3 

CEUbLF25B5.4_u2_LNA3 

CEUbi_F29B9.6_u145_LNA3 

CEUbi_F29B9.6_u230_LNA3 

CEUbLM7.1_u239_LNA3 

CEUbLM7.1_u53_LNA3 

CEUGT_F39G3.1^u40_LNA3 

CEUGT_F39G3.1_u466_LNA3 

CEUGT^MBB. 1 _u480_LN A3 

CEUGT_M88.1_u72_LNA3 

YAL009W_u145_LNA3 

YAL009W_u341_LNA3 

YAL059W_u262_LNA3 

YAL059W_u51_LNA3 

YER109C_u109_LNA3 

YER109C_u436_LNA3 

YHR152W_u128_LNA3 

YHR152W_u510_LNA3 

YKL130C_u211_LNA3 

YKL130C_u85_LNA3 

YKL178C_u199_LNA3 

YKL178C_u367_LNA3 

YLR443W_u179_LNA3 

YLR443W_u86_LNA3 
YOR092W_u251_LNA3 
YOR092W u82 LNA3 


Sequence 

AgcAttGgtGgaAatAcgAaaTggmCatGggAagAgaAacmCccTctmCaaTt 

mCtgGttAcgGtaGtgTatGgtmCccTgtmCctmCtcAgaAtgmCaaAtaTgtmCg 

TctAcgTcgAtgGaaAagmCcgAttTaamCaaTcaAagmCcaAcaAcgmCagTt 

GgaAagGtgmCcaAaaAgtTgamCagmCaaTtgGagGatmCttAttmCatTgcmCa 

AgaTgaTgaTgaAgtTccTgcAaaGaaGccTgcTccAgcGaaGaaAgcTg 

AaaAccTcgTacTggAaaAggAgcTgcGaaAgcGgaAgtTatmCgaTttGt 

GagAagGccmCagAagAagTacGacAgamCtgAagGagmCagTtgAaaAagTt 

TtcTgtmCatAcaAtcGtgmCtaAtcGgcAggTtgmCgaTccTttGtaAccAt 

AagmCttmCggAcamCcaTtgAgaAtgTcaAagmCcaAaaTccAggAtaAggAg 

AatmCgaAccmCatmCaaTtcActmCgtTatTccTccTcg'j.tcTccGttmCaaGt 

mCtgAacmCatmCcaAatAttGaaGatmCcaGctmCagOctGaaGccTatmCagAt 

mCgtGtgmCttAtcTctTctGgaTgaAaamCaaGgaltgGaaGccGtcAatmCt 

mCggAagmCatmCtgmCctTgamCatTctmCcgTtcGcaGtgGtcGccGgcTctG 

AaaGtamCgcTatGtgAggAggmCtaAcamCcaTtcAtaTaaGaamCgcAgcmCa 

TgtTgcmCgtAgaAgaGagActAaaActAagAacGatTgaTtgAagGtcTg 

TacAatTctTtgmCagGaaGcaAtaTccGccGgaGtcmCccmCttAtcActAt 

mCtcAcgGagGttAtaAttmCtaTgcAggAggmCaaTttmCtgmCtgGagTtcmCa 

AccGttTcaTgaGagmCtgTaaTcaGgtGttGttTctGtaAaaAgtGtgAa 

GtgGatGtgAaaTtaGtcmCtcAacmCccAgaGcaTttAgtGcaGagAttAg 

GcaGttTaaTgtGaaGctAgtTaaAgtAcaGtcTacGtgGgamCgaGaaAt 

AttGccAagTccAttTctmCgtGccAagTacAttmCaaAatAcaAgaAagGc 

AgamCtcmCtamCaaAtaGatTcgGtgTccTgcmCagAcgAtgTtgAagAatAg 

TtgAagTttGggAalAttGgtAtgGttGaaGacmCaaGgamCcgGatTacGa 

GagGcgmCaaGtaGgcAatGatTcaAgaAgtAgtAaaGgcAatmCgtAacAc 

TgaGcamCaaAgtTaaGatGttmCggAaaGaaAaaGaaAgtmCaaTccTatGa 

mCaaGtgAccAatmCagmCacGcamCggmCttmCcaTccTcaAgamCtgAtaTta 

mCc 

AttAaalgcGcaGatGagGacGgaAcgAatAtcGgaGaaActGatAatAt 

GatGgtAagmCtgAgcGccTtgGacGaaGaaTttGatGttGtcGctActAa 

TacGtcAcgmCaaGgamCagAgcTttGacGacGaaAtaTcamCttGgaGgaTt 

TctmCccTgtGtaGgtAcamCcaAtaTcamCaaGcgmCatTtcTatGtcGacTa 

TgcTaamCacmCagTttAgamCcaTggAaaTccmCacmCgcAaaTatAagmCaa 

Tg 

GcaGgamCatAagAttmCcgGtcAagmCaamCgamCagTgaAgaAagTatGcaA 
a 

mCcgTctAgtGaaAgcGggAtgGctAaaTtgGgaAaamCgamCaaGatGttAt 
GatGctTcaAtaTccTttGatGgtmCgtTagTttAccAttTttGgtGtcTt 
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YPL263C_u132_LNA3 


YPL263C u257 LNA3 


OMgo Name 


Sequence 

AgtmCatTtgAgtTatGtgAagAccGttGgtGggAaaGaaGagAtcAggTg 
GtcTtgGctAccAcamCccAaaAccGttmCgaAacTttAagAgcAttmCtamCt 


Example 54. Evaluation of different LNA substituted oligonucleotides as probes for 
fluorescence in situ hybridization (FISH) on metaphase chromosomes and interphase 


5 


nuclei. 


Locked Nucleic Acids (LNA) constitute a novel class of DNA analogues that have an 
exceptionally high affinity towards complementary DNA and RNA. Using human classical 
satellite-2 repeat sequence clusters as targets, we demonstrate that LNA/DNA mixmer 
oligonucleotides are excellent probes for FISH combining high binding affinity with short 

10 hybridization time and even with the ability to hybridize without prior termal denaturation of the 
template. The development of molecular probes and image analysis has made fluorescence in 
situ hybridization (FISH) a powerful investigative tool. Although FISH has proved to be a useful 
technique in many areas, it is a fairly time-consuming procedure with limitations in sensitivity. 
Probes with higher DNA affinity may potentially reduce the time needed for hybridization and 

1 5 the sensitivity of the technique. Thus, improvement in hybridization characteristics has been 
reported for the DNA mimic peptide nucleic acid (PNA). This example describes the 
development of LNA substituted oligonucleotides as probes for fluorescence in situ 
hybridization on metaphase chromosomes and interphase nuclei. In each experiment a different 
LNA substituted oligonucleotide of the same 23-bp human satellite-2 repeat sequence 

20 (attccattcgattccattcgatc) have been used, cf Jeanpierre, M. (1994). Human satellites 2 and 3. 
Annals of Genetics 37, 163-171, Oligomers with various LNA content, different labels, and 
hybridization conditions have been used and compared with each other and the optimal 
conditions have been determined for an efficient LNA-FISH protocol. 
A. MATERIALS AND METHODS 

25 Al. Chromosome Preparations 

Chromosome preparations were made by standard methods from peripheral lymphocyte 
cultures of normal males. Slides were prepared 1-4 days prior to an experiment and treated with 
RNAse (10|lg/|ll) at 37^C for one hour before hybridization. 
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A2. Probe preparation 

The 23bp human satellite-2 repeat sequence, attccattcgattccattcgatc, was used to prepare 
the LNA/DNA mixmers with different content and sequence order of LNA modifications (Table 
19). All mixmers were labeled in the 5' end with either Cy3 or biotin. Biotin amidite was 
5 purchased from Applied Biosystems and Cy3 amidite was purchased from Amersham 

Bioscience. A DNA oligonucleotide of the same sequence without any LNA modifications was 
used as a control in each experiment. 
A3. Fluorescence in situ hybridization 

FISH was carried out as described by Silahtaroglu AN, Hacihanefioglu S, Guven GS, 

10 Cenani A, Wirth J, Tommerup N, Tumer Z.(1998) Not para-, not peri-, but centric inversion of 
chromosome 12. J Med Genet. 35(8):682-4. (1); with the following modifications including: The 
amounts of probe were 6.4, 10, 13.4 and 20 pmoles. Denaturation of the target DNA and the 
probe were performed at 75oC for 5 minutes either separately using 70% formamide or 
simultaneously under the coverslip in the presence of the hybridization mixture containing 50% 

15 formamide. In addition the effect of denaturation was also tested. Two alternative hybridization 
mixtures were used: 50% formamide/2xSSC (pH 7.0) 710% dextran sulphate or 2xSSC (pH 7.0) 
710% dextran sulphate. Hybridization times included 30 min, 1 hr, 2 hrs, 3 hrs and overnight. 
Hybridization temperatures included: 37°C, 55°C, 60°C and 72°C. Post washing was either as 
for standard FISH (1), or with 50% formamide/2xSSC at 60°C, or without formamide. 

20 Hybridization signals with biotin labeled LNA substituted oligonucleotide probes were 

visualized indirectly using two layers of fluorescein-labeled avidin (Vector Labs) linked by a 
biotinylated anti-avidin molecule, which amplified the signal 8-64 times. The hybridization of 
Cy3 labeled molecules however, was visualized directly after a short washing procedure. 
Slides were mounted in Vectashield (Vector Laboratories) containing 4'-6'-diamidino-2- 

25 phenylindole (DAPI). The whole procedure was carried out in the dark. The signals were 

visualized using a Leica DMRB epifluorescence microscope equipped with a SenSys charge- 
coupled device camera (Photometries, Tucson, AZ), and IPLAB Spectrum Quips FISH software 
(Applied Imaging intemational Ltd, Newcastle, UK) within two days after hybridization. 
B. RESULTS AND DISCUSSION 

30 Satellite-II DNA, composed of multiple repeats of a 23bp and a 26bp sequence, is 

especially concentrated in the large heterochromatic regions of human chromosomes 1 and 16, 
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but is also found in the heterochromatic regions of chromosomes 9, Y, 15 and in other minor 
sites like the short arms/satellites of the acrocentric chromosomes and some centromeric regions. 
Classical satellite DNA can be visualised by FISH with traditional genomic and DNA 
oligonucleotide probes (see Kokalj-Vokac N, Alemeida A, Gerbault-Seureau M, Malfoy B, 

5 Dutrillaux B. (1 993) Two-color FISH characterization of i(l q) and der(l ; 1 6) in human breast 
cancer cells. Genes Chromosomes Cancer, 7, 8-14; and Tagarro I, Femandez-Peralta AM, 
Gonzales- Aguilera J J. (1994) Chromosomal localization of human satellites 2 and 3 by a FISH 
method using oligonucleotides as probes. Hum Genet. 93(4):383-8). Due to this and the presence 
of distinct major and minor sites of satellite-2 DNA in the genome, we used the 23-bp satellite-2 

10 repeat sequence, attccattcgattccattcgatc, as a convenient model to test the efficiency of various 
DNA/LNA mixmers for FISH analysis and the effect of different experimental conditions by 
recording the number, location and strength of signals on each metaphase. To compare the 
efficiency of mixmers with different LNA content (Table 19) and to optimize the LNA-FISH 
protocol, different conditions were tried at each step of a standard FISH protocol as described in 

15 Materials and Methods. All LNA substituted oligonucleotides (LNA/DNA mixmer 

oligonucleotides) for human satellite-2 sequence gave very prominent signals when used as FISH 
probes. In general, the signal on chromosome 1 was always stronger and appeared earlier, 
followed by signals on chromosomes 16, 9, Y, 15, other acrocentric chromosomes and the 
centromeric regions of other chromosomes, respectively (Figure 41). In general, biotin labeled 

20 mixmers gave stronger signals with a higher background, whereas Cy3-labeled molecules gave a 
significantly lower background. 

Bl. Effect of LNA content of the LNA substituted oligonucleotides (LNA/DNA mixmers) 

The LNA-2 molecule which had every other nucleotide modified as LNA. 
(aTtCcAtTcGaTtCcAtTcGaTc (SEQ ID NO: )) gave the best results in all the experiment 

25 performed. The LNA-3 molecule, with every third oligonucleotide modified as LNA, 

(aTtcCatTcgAtTccAttCgaTc (SEQ ID NO: )) also gave hybridization signals, but with less 
efficiency than the LNA-2 probes. Preferably, an LNA-2 oligonucleotide molecule has an LNA 
unit at every other nucleotide position in the sequence and an LNA-3 oligonuclotide molecule 
has an LNA unit at every third position of the sequence. However, minor deviations, e.g. in one 

30 position or less than 5-10 percent of the nucleotide positions in the sequence may still provide 
the general features of an LNA-2 or an LNA-3 molecule. 
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The Dispersed LNA (aTtccatTcgaTtccAttcgaTc (SEQ ID NO: )), which had 5 dispersed 
LNA modifications, was less efficient in short term hybridization, but gave signals on both 
chromosomes 1 and 16 after overnight hybridizations. LNA/DNA mixmers with 3 LNA Blocks 
(aTTCcattcgATTccattcGATc (SEQ ID NO: )) was comparably inferior as a FISH-probe. 
5 B2. Effect of amount of the LNA/DNA mixmers 

The initial experiments performed with 20 pmol of LNA/DNA mixmer resulted in bright 
and large signals, but with an extremely high background. Thus, lower concentrations were 
tested (13.4 pmol, 10 pmol and 6.4 pmol). The concentration giving the optimal signal to noise 
ratio was found to be 6.4 pmol. 
1 0 B3. Effect of denaturation 

The signals on the major sites of hybridization (Iq, 16q) were equally bright after both 
types of denaturation. However, smaller and weaker signals were observed on the minor sites 
with the simultaneous denaturation protocol. To check the potential "strand invasion" property of 
LNA, some of the experiments were performed without a denaturation step. As expected, no 
15 signals were obtained by the control DNA oligonucleotide probe. In contrast, hybridization 

signals on chromosomes 1 and 1 6 were observed after overnight hybridization with LNA probes, 
with LNA-2 mixmer giving the best signals. Compared to the signals obtained in experiments 
involving a denaturation step, the signals were smaller, but prominent and without any 
background. 

20 B4, Effect of hybridization time, temperature and post-hybridization washes 

Although signals could be observed after only 30 min of hybridization, the optimal 

hybridization time and temperature for LNA-2, which gave the best signals, was 1 hr at 37^C. 

A 3x5 min wash with 0.1xSSC/60''C and 4xSSC/0.05%Tween/37'*C, respectively, followed by a 

5 min PBS wash was found to be sufficient for washing the slides after hybridization with DNA- 
25 LNA mixmers. There was no specific difference between a wash with 50% formamide at 42^C or 

60^C. 


The signals faded away in most of the slides within two days. When hybridized with 
directly labeled LNA, the whole slide was stained with Cy3 after three days. Thus, slides had to 
30 be analyzed within 48 hours after hybridization. 
C. CONCLUSION 
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The experiments have demonstrated, that LNA substituted oHgonucleotides are very 
efficient FISH probes. LNA substituted oligonucleotide probes gave strong signals after only 1 
hr of hybridization, and it was possible to omit the use of formamide both from the denaturation 
and from the post hybridization washing steps and still obtain a very good signal to noise ratio. 
5 The ability of LNA to hybridize without prior denaturation could be due to a strand invasion 
property of LNA and this warrants further investigation with other LNA probes and at different 
conditions.Based on the combined results of these experiments, the optimal LNA-FISH 
procedure was defined as follows: 6.4 pmoles of Cy-3 labeled LNA-2 probe was denatured 
together with the target at 75°C for 5 minutes, and hybridized for one hour then followed by a 

10 short post wash without any formamide (3x 5 minutes O.lxSSC at 60^C; 2x Sminutes 
4xSSC/0.05%Tween at 3TC; 5 minute PBS). The FISH experiments indicate that LNA 
containing probes would be valuable for the detection of a variety of other repetitive elements, 
such as centromeric a-repeats and telomeric repeats. In addition, the superior hybridization 
characteristics of LNA containing oligonucleotides could lead to detection of single base pair 

15 differences between repetitive sequences as well as single copy sequences. 

CI. Figure 41 shows a comparison of different LNA/DNA mixmer oligonucleotides. Experiment 
conditions: 6.4 pmoles of Cy3 labeled probe was hybridized for 30 minutes at 37^C, after 
simultaneous denaturation of the target and the probe at 75^C for 5 minutes. A. LNA-2 giving 
signals on chromosomes 1, 16, 9 and 15, B. LNA-3 giving bright signals on chromosomes 1,16 

20 and 9, C. Dispersed LNA giving signals on chromosomes 1 and 16 only, D. LNA Block giving 
smaller signals on chromosome 1, E. DNA oligo giving no signals on any of the chromosomes. 


Table 19. DNA/LNA mixmers for human satellite 2 repeat sequence used in this study. (SEQ ID 
NOs:) 

FISH probe sequences 
attccattcgattccattcgatc 
aTtccatTcgaTtccAttcgaTc 
aTtcCatTcgAtTccAttCgaTc 
aTTCcattcgATTccattcGATc 
ATtCcAtTcGaTtCcAtTcGaTc 


Name 
DNA oligo 
Dispersed LNA 
LNA-3 
LNA Blocks 
LNA-2 


LNA monomers Tm* 

0 60 

5 71 

8 77 

9 73 
11 84 


25 
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LNA modifications are depicted in capital letters and *Tm values for each molecule have been 
calculated using Exiqon's Tm Prediction program accessible at http://lna-tm.com/ and as appears 
in Figs. 19A-19F herein. 


S Example S5. Highly efficient fluorescence in situ hybridization (FISH) using an LNA probe 
specific for human telomere repeat 

1. Chromosome Preparations 

Chromosome preparations were made by standard methods from peripheral lymphocyte cultures 
of two normal males. Slides were prepared 1-6 days prior to an experiment and treated with 
1 0 RNAse {\0\ig/\i\) at 3TC for one hour before hybridization. 

2. FISH probe preparation 

A Cy3-labelled, LNA-2 design of the 24-bp telomere sequence (ttagggttagggttagggttaggg) 
representing 4 blocks of 6-bp telomere repeat (ttaggg) was used as a probe. A DNA oligomer of 
the same sequence without any LNA modifications was used as a control in each experiment. 

15 3. Fluorescence in situ hybridization 

FISH was carried out as described previously (Silahtaroglu et al, 1998) with the following 
modifications. The amount of probe was Spmoles. Denaturation of the target DNA and the probe 
were performed at 75^C for 5 minutes simultaneously under the coverslip in the presence of 
hybridization mix containing 50% formamide. Slides were washed after 30 min. hybridization at 

20 37 ^C. Post washing steps included a 2x5min O.IXSSC at 60 ''C; 5min 2XSSC at 3TC; 3min 
4XSSC/0.05%Tween20 at 37*'C and 5min PBS. Slides were mounted in Vectashield (Vector 
Laboratories) containing 4'-6'-diamidino-2-phenylindole (DAPI). The whole procedure was 
carried out in the dark. The signals were visualized using a Leica DMRB epifluorescence 
microscope equipped with a SenSys charge-coupled device camera (Photometries, Tucson, AZ), 

25 and IPLAB Spectrum Quips FISH software (Applied Imaging international Ltd., Newcastle, 
UK). 

4. Results 

The human telomere repeat specific LNA oligonucleotide probe gave prominent signals on the 
telomeres, when used as a FISH probe (Fig. 42), whereas no signals could be detected with the 
30 corresponding DNA control probe when using the hybridization conditions specified above. 
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Thus, the experiments described here for human telomere repeat, demonstrates that LNA 
substituted oHgonucleotides are highly efficient as FISH probes. 
5. References 

Silahtaroglu, A.N., Hacihanefioglu, S., Guven, G.S., Cenani, A., Wirth, J., Tommerup, N., 
5 Tumer, Z.(1998) Not para-, not peri-, but centric inversion of chromosome 12. Journal of 
Medical Genetics 35(8), 682-684. 

Example 56. Fluorescence in situ hybridization using chromosome-21 specific centromere 
LNA probes. 
10 1. Chromosome Preparations 

Chromosome preparations were made by standard methods from peripheral lymphocyte cultures 
of a normal female. Slides were prepared 5 days prior to use. Before use slides were treated with 
RNAse A (10p.g/^il) at 37®C for one hour and proteinase K for 10 minutes washed, with 2xSSC 3 
times 3 min, before dehydration through a cold ethanol series. 

15 

2. Probe preparation 

A 5' biotin-labelled, LNA substituted 15-mer FISH probe (aCcCaGcCaAaGgAg (SEQ ID NO: ), 
LNA uppercase, DNA lowercase) and a 5' biotin-labelled LNA substituted 24-mer FISH probe 

(TgTgTaCcCaGcCaAaGgAgTtGa (SEQ ID NO: ), LNA uppercase, DNA lowercase) specific for 
20 the centromeric human chromosome 21 alphaRI(680) locus alpha-satellite repeat were used as 
probes. Biotin-labelled DNA probes of the same sequence without any LNA modifications were 
used as controls in each experiment. 

3. Fluorescence in situ hybridization 

25 FISH was carried out as described previously (Silahtaroglu et al, 1998) with the following 

modifications. The amount of probe was l|iM for Ithe 15-mer chromocsome 21 FISH probe and 
1 .4 |iM for the 24.mer FISH probe. Denaturation of the target DNA and the probe were 
performed simultaneously at 79^C for 4 minutes under the coverslip in the presence of 
hybridization mix containing 50% formamide. Slides were washed after 40 min. hybridization at 

30 RT. Post washing steps included a 2x5min O.IXSSC at 65 ^C; 5min 3min 

4XSSC/0.05%Tween20 at 3TC. Slides are then incubated lOmin with 1% blocking reagent and 
a layer of Flourescein conjugated Avidin (Vector Labs) has been applied for 20 minutes at ZTC. 
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After a 3 times 3 minute wash with 4XSSC/0.05% Tween20, slides are dehydrated and mounted 
in Vectashield (Vector Laboratories) containing 4'-6'-diamidino-2-phenylindole (DAPI). The 
whole procedure was carried out in the dark. The signals were visualized using a Leica DMRB 
epifluorescence microscope equipped with a SenSys charge-coupled device camera 
5 (Photometries, Tucson, AZ), and IPLAB Spectrum Quips FISH software (Applied Imaging 
international Ltd., Newcastle, UK). 

4. Results 

The LNA substituted 15-mer oligonucleotide probe specific for the centromeric human 
chromosome 21 alphaRI(680) locus alpha-satellite repeat gave prominent signals on 
chromosome 21, when used as a FISH probe, whereas no signals could be detected with the 

•J 

corresponding DNA control probe when using the hybridization conditions specified above. The 
LNA substituted 24-mer oligonucleotide probe specific for the centromeric alphaRI(680) locus 
alpha- satellite repeat gave prominent signals both on chromosomes 13 and 21, when used as a 
FISH probe, while no signals were observed with the DNA control probe. This is expected, since 
the aforementioned chromosomes differ only at one nucleotide position in the given probe 
sequence. On the other hand, the results obtained by the 15-mer LNA FISH probe clearly 
demonstrates that the LNA substituted probe is capable of discriminating a single mismatch 
between chromosomes 13 and 21 in the centromeric alpha-satellite repeat. Thus, the experiments 
described here for the centromeric repeat-specific LNA probes in the human chromosome 21, 
demonstrates that LNA substituted oligonucleotides are highly efficient as FISH probes and can 
be used in diagnosis of chromosome 21 trisomy. 

Other Embodiments 

25 From the foregoing description, it will be apparent that variations and modifications may 

be made to the invention described herein to adopt it to various usages and conditions. The 
foregoing description of the invention is merely illustrative thereof, and it understood that 
variations and modifications can be effected without departing fi-om the scope or spirit of the 
invention. 

30 All publications, patent applications, and patents mentioned in this specification are 

herein incorporated by reference to the same extent as if each independent publication, patent, or 
patent application was specifically and individually indicated to be incorporated by reference. 
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Other embodiments are in the claims. 
What is claimed is: 
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